FEBRUARY, 1958 


The Journal of Educational 
Psychology 


we 
CONTENTS 


Patterns of Personal Problems of Adolescent Girls 
RICHARD E. SCHUTZ 
Readability Level and erential Test Performance: A Language 
JEROME LEVY 
The Attitudes Students Assign to Their Teacher 
NORMAN M. CHANSKY 


A Purther Note on Basal Metabolism and Avademic Performance... . 
MARY E. YARBROUGH AND HAROLD G. McCURDY 


An Ezperimental Prognostic Test for Remedial Readers 
RICHARD W. WOODCOCK 
Effect of an Audio-Vieual Phonics Aid in the Intermediate Grades.... 28 


The Differential Effect of Manifest Ancicty on Test Performance... 43 


EVAN W. PICKREL 


Some Secioeconomic Correlates of Academic Aptitude 
KHOSSROW MOHANDESS! AND PHILIP 3. RUNKEL 


AMERICAN PSYCHOLOGICAL ASSOCIATION 


Vel. 49 Ne.1 
1 
6 
13 
CAROLYN LUSER, EILESN STANTON, AND CHARLSS 1. DOYLE 
JAMES G. COOPER 
The Effect of Child on Attitudes Toward Parent-Child 
FRANK COSTIN 
— 


The Journal of Educational Psychology 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 


Retered Second Class idl the ct of March 3, 1879. 


Raymonp G. Editor 
Syracuse University 
Advisory Editors 
J. Cronsace T. NEwianp 
University of Unéver sity of [Uénois 
W. Harnis H. H. Rewerrs 
University of Wisconsin Purdue Unisersity 
Exwest R. Hircarp Juxtan C. 
Stanford Untearsity University of Wisconsin 
Nicuoras Hosss J. B. Srnoup 
George Peabody College for Teachers State University of lowa 
E. Jonzs Doman E. Super 
University of California Teachers College, Columbia Unisersiiy 
- Invoic Lorce PerctvaL M. Symonps 
Teachers College, Columbia University — Teachers College, Colambia Universtiy 
Dorormza Grorcr G. 
Fordham University Syrecuse University 
Artuur C. Horrman, Managing Editor 
Onn, Circulation Manager 
Sanam Wowmacx, Editorial Assistant 
The Journal of Educational publishes investigations of problems 
on the i pp ten of school subjects; experimental studies of learning; 
the development of attitudes, personality, social relations, and vocational orienta- 
Ont methods. ‘The techs and re- 
search techniques in cross-sectional and developmental 
Manuscripts and correspondence on editorial matters should be addressed to the Editor: 
All manuscripts should be submitted in to 
=~ and must be prepared to conform tothe style and procedure 
Menual of the Americen Psychological A ion. Because of 
A orders of back address 
Sanding ‘postage. Other for wnicivered copies must be made withio four months of 
PY annual subscription: $8.00 (Foreign $8.50). Single copies, $1.50. 
PUBLISHED BIMONTHLY BY: 


THE JOURNAL OF 
EDUCATIONAL PSYCHOLOGY 


Volume 49 February, 1958 Number 1 


EDITORIAL NOTE 


This is the first issue of the first complete vol- 
ume of the Journal of Educational Psychology 
to be published by the American Psychological 
Association and the first issue under a new editor. 
Delays and other matters incident to the transi- 
tion of the journal to both a new publisher and a 
new editor have resulted in certain temporary de- 
partures from usual policy and some delay in the 
publication of first issues. Contributors and read- 
ers will note in this and the next several issues, 


for example, certain inconsistencies in style and 
certain departures from standard practice of pub- 
lishing manuscripts in order of their receipt. In 
the future, contributors should follow the style 
prescribed by the Publication Manual of the 
American Psychological Association. 

R. G. K. 


» 


s 


JOURNAL OF EpvucaTIONAL PsYCHOLOGY 
Vol. 49, No. 1, 1958 


PATTERNS OF PERSONAL PROBLEMS OF ADOLESCENT GIRLS! 


RICHARD E. SCHUTZ? 


Teachers College, Columbia University 


Numerous investigators have deter- 
mined the frequency with which various 
problems are listed by given samples of 
adolescents (4). However, in tabulating 
the problems it has been necessary to rely 
on a priori systems of classification. The 
customary procedure has been to cate- 
gorize the problems by activity or fune- 
tional areas; e.g., school, home, health, 
ete. 

The purpose of the present study was 
to determine the pattern or structure un- 
derlying the personal problems which 
adolescents recognize and are willing to 
report on a youth problems inventory. 
This pattern was investigated by extract- 
ing homogeneous clusters from a sample 
of 156 items selected from the inventory. 


PROCEDURE 


The inventory used in the study was the 
Billett-Starr Youth Problems Inventory, 
Senior Level (2), a check list intended to 
provide a means of systematically identi- 
fying the personal problems of individual 
adolescents. The 441 items which make up 
the Inventory include problems mentioned 
in the compositions and free responses of 
several large samples of high school stu- 
dents which the authors obtained in de- 
veloping the instrument. The items are 


Based on a dissertation submitted in 
partial fulfillment of the requirements for 
the Ph.D. at Columbia University. The in- 
vestigation was carried out under the help- 
ful direction of Robert L. Thorndike. The 
writer wishes to thank Roger T. Lennon and 
the staff of the Division of Test Research 
and Service, World Book Co. for making 
the study possible. 

* Now at Arizona State College, Tempe, 
Arizona. 


organized into 11 areas designated as fol- 
lows: 


. Physical Health, Fitness, and Safety 

. Getting Along with Others 

. Boy-Girl Relationships 

. Home and Family Life 

. Personal Finance 

. Interests and Activities 

. School Life 

Heredity 

. Planning for the Future 

10. Mental-Emotional Health and Fit- 
ness 

11. Morality and Religion 


The cluster analysis was based on the 
responses of 500 girls in Grades 10 and 11 
in two Pinellas County, Florida, high 
schools who took the Inventory as part of 
the national standardization program in 
May 1956. The schools are three-year high 
schools and had a segregated white en- 
roliment at the time of the study. The 
Inventory was administered in regular 
classrooms by regular teachers. The In- 
ventories were signed by the students. 

The basic technique of analysis was that 
described by Loevinger, Gleser, and Du- 
bois (6) for deriving clusters which have 
maximum reliability as estimated by 
Kuder-Richardson Formula 20. Each 
cluster is obtained by starting with a triad 
of items having the highest covariance and 
adding items in succession, adding always 
the item for which the ratio of the sum 
of covariance with the items already in 
the cluster is a maximum. Items are added 
to the cluster until no more items remain 
which will increase this ratio. The process 
is repeated on the residual pool of items 
to form the second and subsequent clus- 
ters. 


1 
» 


RICHARD E. SCHUTZ 


A sample of 156 items was selected to 
be included in the cluster analysis (8, pp. 
57-64). The items selected have the fol- 
lowing characteristics: (a) they are in- 
cluded in both the Junior and Senior levels 
of the Inventory; (b) they were rated as 
“very serious” or “moderately .serious” 
problems by a panel of 20 guidance spe- 
cialists; (c) each was marked by at least 
5% of the Ss in the sample. An attempt 
was made to make the sample of items 
representative of the 11 areas of the In- 
ventory and to include as many items as 
possible which have counterparts in other 
published problems check lists. 

The Inventory attempts to get at the 
intensity of a student’s problems by allow- 
ing him to differentiate between those 
which bother him “some” and those which 
bother him “very much.” For the present 
analysis each S’s “some” and “much” re- 
sponses were combined into a single cate- 
gory. 

Each S’s_ responses were multiple 
punched on an IBM ecard, and the 156 
by 156 co-occurrence matrix was prepared 
using the counting sorter. The figures in 
the co-occurrence matrix were converted 
to percentages and the variance-covari- 
ance matrix prepared. The cluster analy- 
sis was performed, and a check was made 
on the factorial purity and reliability of 
the obtained clusters. 


REsvULTs 


Three clusters were extracted from the 
pool of 156 items. Eighty-three items are 
included in Cluster I, 16 in Cluster I], and 
17 in Cluster III. An abbreviated Cluster 
I, consisting of 37 items, was formed by 
eliminating 35 items which correlated less 
than 40 with the complete cluster and 11 
items which nearly duplicated another 
item in the cluster; e.g., the two items, 
“T’m often restless,” and “I’m restless 
most of the time.” The items in the ab- 
breviated Cluster I and in Clusters II and 
III are shown in Tables 1-3. The items are 
arranged in order of the magnitude of the 


point biserial correlation of each item 
with its cluster. The area and item num- 
ber within the area are indicated in the 
first column of each table. The Kuder- 
Richardson Formula 20 reliabilities of the 
clusters and their intercorrelations are 
shown in Table 4. 

The nature of a cluster must be deter- 
mined by examining the items to discover 
the general attribute they seem to hold in 
common. The items in Cluster I cover a 
broad area, coming’ from eight areas of 
the Inventory. The cluster appears to re- 
flect a general feeling of personal anxiety 
and insecurity. 

The items in Cluster II are currently 
classified under seven different area head- 
ings in the Inventory. They seem to in- 
volve a feeling of nervous tension concern- 
ing relationships with other persons. This 
cluster is the least homogeneous of the 
three, and its correlation with Cluster I 
is nearly as high as its reliability. Both 
Clusters I and II reflect personal anxiety. 
While the items in Cluster II did not have 
enough in common with the items in Clus- 
ter I to be included in the more homoge- 
neous general cluster, they shared sufficient 
common variance to form another with 
lower reliability. 

Cluster III is the only cluster that does 
not cut across the functional area organi- 
zation of the Inventory to any great ex- 
tent. Fifteen of the 17 items in the cluster 
come from Area IV of the Inventory 
headed “Home and Family Life.” The 
items all represent some kind of difficulty 
in getting along with parents. 

If a cluster is factorially pure, all of its 
common factor variance should be ac- 
counted for by a single centroid factor. 
The ratio of the first factor variance to 
the common factor variance thus provides 
a basis for evaluating the factorial purity 
of a cluster. 

A complete centroid analysis (9) was 
performed independently on the items in 
the abbreviated Cluster I and in Clusters 
II and III. The highest correlation coeffi- 


| 

2 


PROBLEMS OF ADOLESCENT GIRLS 


TABLE 1 
Custer I 


Item No. r Item 


People don't understand me. 


10-38 
10-10 .57 I'm afraid of making mistakes. 
10-15 55 I'm often restless. 
10-43 51 I worry about what others say. 
10-37 51 I feel I'm not wanted. 
10-17 50 I’m disgusted with myself (dislike myself very much). 
10-19 49 I need someone to give me advice. 
8&3 49 I would like to be able to do something well. 
49 I don’t understand myself. 
| 11-10 tt) Many times I don’t know what is right and what is wrong. 
10-39 48 People talk about me behind my back. 
10-2 48 I feel uncertain (unsure) about everything. 
10-34 47 I spend too much time daydreaming. 
10-3 46 I need to learn to depend on myself. 
10-35 45 I feel sorry for myself. 
10-46 45 I get excited too easily. 
11-6 45 I’m sometimes troubled by immoral (bad) thoughts. 
7-5 45 I wonder if I'll pass. 
10-32 44 I don’t get out and go after what I want. 
9-18 M4 I wonder if I’m taking the right subjects. 
744 44 I'm afraid to take tests. 
10-58 44 I'd like to know how to get rid of a bad habit. 
4-46 44 I'm unhappy at home. 
7-62 43 Some teachers never encourage or help me. 
6-1 43 I seldom have anything interesting to do. 
7-85 43 I would like to know how to get along with certain teachers. 
8-2 43 I wonder what my real mental ability is. 
10-13 42 I wonder what my future will be. 
10-1 42 I’m confused by the way things change. 
2-50 42 I feel lonely most of the time. 
4-40 42 I'm afraid to tell my (father) (mother) when I’ve done something 
wrong. 
10-42 42 I’m blamed for things that aren’t my fault. 
2-52 Al I find it hard to make friends. 
10-30 Al I don’t know how to (pay attention) (work or study hard). 
11-5 41 I often tell lies. 
10-45 40 I’m bothered by people who find fault with me. 
I can’t control my temper. 


10-47 -40 


cient in each column of the matrix was 
used as the communality estimate, com- 
munalities being re-estimated by this 
method for every residual matrix. Factor- 
ing was considered complete when both 
Tucker’s Phi and Coombs’ criterion (3) 
indicated that the last significant factor 
had been extracted. Three factors were 
extracted from Cluster I, three from Clus- 
ter II, and two from Cluster III. First 
factors account for 80%, 69%, and 85%, 


respectively, of the common factor vari- 
ance of the clusters. 

To investigate the extent to which the 
reliability of the clusters is dependent on 
chance factors, the Kuder-Richardson 
Formula 20 reliability of each cluster was 
computed, based on the results of a new 
sample of 73 Ss selected from the same 
population as the original sample. The re- 
liability coefficients for the new sample 
were as follows: Cluster I, 94; abbrevi- 


» 
3 
» 


RICHARD E. SCHUTZ 


TABLE 2 
Cuiuster II 


Item No. Item 


I’m nervous when I talk to people. 

I’m not good at talking with people. 

I’m nervous in front of the class. 

I get tired easily. 

(I’m afraid) (1 don’t like) to meet people. 

I want others to like me. 

I’m always nervous. 

I don’t understand (boys) (girls). 

I’m not good-looking. 

I would rather be alone. 

I get tired from too much activity. 

I spend too much time on (radio) (television) (movies). 

I need to know more about sex (body changes at my age) (new 
body functions). 

I’m afraid I seem conceited (stuck-up). 

I’m not sure whether I should go to college. ~ 

I (don’t have) (don’t know how to pick) the right clothes. 


2-11 
2-15 
7-38 
1-19 
2-10 
2-39 
1-21 
3-1 


BSE 


TABLE 3 
Ciuster III 


Item No. 


My (father) (mother) is always criticizing (blaming) (nagging) 
me. 

I can’t discuss things with my (father) (mother). 

My (father) (mother) is always expecting too much of me. 

I’m often the cause of family quarrels (parents argue about things 
I do). 

I’m thinking of leaving home. 

My (father) (mother) has little or no interest in what I do. 

I sometimes lie to my (father) (mother) to get permission to do 
something. 

My (father) (mother) is often nervous and irritable. 

My (father) (mother) never asks my opinion about anything im- 
portant to the family. 

I dislike my (father) (mother) very much. 

I don’t agree with my (father) (mother) about my out-of-school 
activities. 

I’m sometimes ashamed of things my parents do or say. 

My (father) (mother) pries into my private affairs. 

My (brother) (sister) is always causing me trouble. 

I don’t get an allowance. 

I often ‘‘stretch the truth’’ when I tell something. 

There’s too much drinking in our home. 


See fee 


£8 2 
bs 


ated Cluster I, 91; Cluster II, 63; Clus- and the original group was computed us- 
ter III, 83. ing Fisher’s z transformation. The dif- 

The significance of the difference be- ference for Cluster II is significant at the 
tween each of the coefficients for the new _.001 level. There is no significant difference 


2-6 
6-10 
6-9 
6-11 
1-22 
10-40 
9-25 
2-7 
Item 


PROBLEMS OF ADOLESCENT GIRLS 5 


TABLE 4 
CLUSTER INTERCORRELATIONS AND KuDER- 
RICHARDSON RELIABILITIES 
1(37) 


Cluster 1(83) 


I (83) | (.94) | .98 

I (37) | (.90) | 
60 


| 


| 


L 


between the coefficients for the other 


clusters. 
Discussion OF RESULTs 


The findings of the study have implica- 
tions for both the theory and measure- 
ment of adolescent problems. While the 
study was not designed to test the validity 
of the various theories of adolescent prob- 
lems that have been proposed (1, 5, 7), 
it does provide evidence concerning the 
way in which problems cluster or “go to- 
gether.” 

The cluster structure does not corre- 
spond closely to any of the theoretical 
frameworks which have been proposed for 
classifying adolescent problems. The com- 
position of Cluster I suggests that a single 
dimension of personal anxiety underlies 
many of the manifest problems which 
theorists have used several dimensions to 
explain. 

The fact that two of the clusters cut 
across several areas of the Inventory in- 
dicates that the classifying of items in 
problems check lists into the traditional 
functional or activity categories is in large 
part an arbitrary procedure. However, 
even though the conventional rubrics do 
not all represent true functional unities, 
teachers and counselors may still find the 
categories helpful. The subdivisions often 
suggest programs of action related to the 
kinds of services that schools and other 
social agencies are equipped to provide. 
Thus, the traditional area organization 
often serves the purpose of suggesting foci 
of therapeutic action. 

Although no data are yet available con- 


cerning the predictive validity of the clus- 
ters, they provide a potential means of 
screening adolescents in need of psycho- 
logical help. They have high face validity 
and are reasonably pure factorially. A 
study of predictive validity is required to 
determine the extent to which they are 
actually related to personal adjustment. 


SUMMARY 


A cluster analysis was performed on 156 
selected items from the Billett-Starr 
Youth Problems Inventory, Senior Level, 
based on the responses of 500 adolescent 
girls. Three clusters were extracted and 
designated as follows: Cluster 1—General 
personal anxiety and insecurity; Cluster 
il—Tension concerning relations with 
others; and Cluster I]]—Difficulties in 
getting along with parents. The implica- 
tions of the cluster structure for the the- 
ory and measurement of adolescent prob- 
lems was briefly discussed. ; 

REFERENCES 
1. AususeL, D. P. Problems of adolescent 
adjustment. Nat. Assoc. sec. Sch. Prin. 
Bull., 1950, 34 (167), 1-84. 

. R. O. & Srarr, I. 8. Billett-Starr 
Youth Problems Inventory. Yonkers- 
on-Hudson, N. Y.: World Book, 1956. 

. Frucuter, B. /ntroduction to factor anal- 
ysts. New York: Van Nostrand, 1954. 

. Grant, B. Survey of studies on problems 
of adolescents. Calif. J. sec. Educ., 1953, 
28, 293-297. 

. Havicuurst, R. J. Human development 
and education. New York: Longmans, 
Green, 1953. 

. Loevincer, Jane, Guieser, C. & 
Dvsots, P. H. Maximizing the discrimi- 
nating power of a multiple-score test. 
Psychometrika, 1953, 18, 309-317. 

. Lucuins, A. 8. On the theories and prob- 
lems of adolescence. J. genet. Psychol, 
1954, 85, 47-63. 

. Scuutz, R. E. Patterns of personal prob- 
lems of adolescent girls. Unpublished 
doctoral dissertation, Columbia Univer., 
1957. 

. Tuurstone, L. L. Multiple factor analy- 
sis. Chicago: Univer. of Chicago Press, 
1947. 


Received August 10, 1967. 


.67 
.63 .52 
71) | .34 
Il 34 (.81) 


Journat or Epucationat Psycnotoey 
Vol. 49, No. 1, 1958 


READABILITY LEVEL AND DIFFERENTIAL TEST 
PERFORMANCE: A LANGUAGE REVISION 
OF THE STUDY OF VALUES 
JEROME LEVY' 
Denver University 


Psychologists have recently become in- 
terested in the “readability” of material; 
that is, how the verbal difficulty of written 
material affects the communication of 
ideas. Hebb and Bindra (5), Ogdon (7), 
and Stevens and Stone (8) have pub- 
lished material relevant to this question. 
These writers have been concerned with 
readability in discursive writings; eg., 
textbooks. The important question, how- 
ever, of how the readability level of a 
psychological test might affect the score 
which a subject with a defined level of 
verbal competence achieves has not 
hitherto been directly investigated. It ap- 
pears to the writer that this is a question 
of considerable importance to sound psy- 
chometric practice. The present study was 
designed to investigate this question. 

The Allport-Vernon Study of Values, 
1951 Revision (1), was selected as an ap- 
propriate instrument. Since its introduc- 
tion in 1931 and following its revision in 
1951, the Study of Values has been em- 
ployed in researches in personality theory, 
and for clinical evaluation and guidance 
purposes, and has proved quite useful in 
these areas. Many writers have noted, 
however, that the level of language em- 
ployed in the Study of Values is quite 
complex and difficult. Gough (4), for ex- 
ample, writes: “The language used is too 
academic and involved for use in groups 
very far removed from a scholastic en- 
vironment.” Allport notes: “The scale is 
designed primarily for use with college 
students or with adults who have had col- 

*Now Chief Psychologist, Division of 


Mental Health, Colorado State Department 
of Public Health. 


lege (or equivalent) education” (1). Al- 
though it is not explicitly stated, one sur- 
mises that the basis of this view is the 
recognition of the verbal complexity of the 
test, particularly in terms of vocabulary 
usage. If, however, verbal difficulty is the 
limiting factor as seems suggested, then 
the Study of Values may not be a valid 
test even for all college students. Classes 
in remedial reading, basic communication, 
and other related areas in every university 
attest to the large numbers of students 
deficient in these verbal skills. 

The present study represents an at- 
tempt to produce a modification of the 
Study of Values congruent in all respects 
with the 1951 Revision, except that it em- 
ploys less difficult language. Such a modi- 
fication may be useful in two ways: if it 
can be demonstrated equivalent to the 
1951 Revision it may be validly used with 
Ss for whom the 1951 Revision is valid, 
and may also permit extension of the test 
to populations hitherto considered inap- 
propriate (i.e., noncollege level Ss). Sec- 
ondly, it may be utilized in demonstrating 
the effects of differential vocabulary abil- 
ity on test scores. 


PROBLEM 


There are four major phases to this re- 
search: (a) actual eonstruction of the 
modification; (b) a test of the meaning 
equivalence of the items of the modifica- 
tion and the 1951 Revision; (c) a demon- 
stration that the language level of the 
modification is indeed lower than that of 
the 1951 Revision; and (e) a demonstra- 
tion that there will be differences in per- 
formance between the two forms for Ss 


READABILITY AND TEST PERFORMANCE 7 


whose vocabulary ability is below that in- 
herent in the 1951 Revision which will be 
significantly greater than the between- 
form differences for Ss whose vocabulary 
ability is equal to that of the 1951 Revi- 
sion. This is the major focus of the in- 
vestigation and the primary hypothesis to 
be tested. Consequent hypotheses for test- 
ing are: (a) For a high vocabulary ability 
group the between-form performance dif- 
ferences will be of a minimal nonsignifi- 
cant nature; (b) For a low vocabulary 
ability group there will be a large and 
statistically significant difference in per- 
formance on’ the two forms; and (c) For 
a group of Ss intermediate on the vocabu- 
lary criterion, the between-form differ- 
ences will fall between that for the high 
and that for the low groups. 


METHOD 


Construction of the modification. Of the 
45 items in the 1951 Revision, all but one 
were reworded. Items were constructed 
which attempted to maintain the struc- 
ture and meaning of the equivalent origi- 
nal items, but which were worded much 
more simply. Where a choice of words 
with equivalent meanings was to be made, 
the simpler word was always chosen. 

Equivalence of meaning. Both the 1951 
Revision and the modified form were given 
to a group of judges with a rather hetero- 
geneous background in psychology. These 
judges were unfamiliar with the Study of 
Values. They were supplied with the defini- 
tions of the six predominant personality 
types as they appear in the Manual of 
Directions (1) so that they would have 
clear referents for each scale. They were 
presented the two forms in mixed order 
and asked to judge which type of person 
would answer each question with each al- 
ternative choice. For example, in the item 
“Assuming you have sufficient ability, 
would you prefer to be: (a) a banker; (b) 
a politician?” which of Allport’s types 
would choose alternative (a), which would 


choose (b)? The judges also matched the 
items of both forms by number; the items 
of each form having been randomized. 

Readability. In order to evaluate the 
hypothesis that the language level of the 
modified form was simpler, a test of 
“readability” was employed. This is the 
Flesch formula (3). 

Between-form differences. A group of 
157 young male Air Force Ss was ad- 
ministered the 1951 Revision, the present 
modification, and the Diagnostic Reading 
Test, Survey Section (2). The two forms 
of the Study of Values were given in bal- 
anced order of administration, with the 
Diagnostic Reading Test coming between 
them for all Ss. The vocabulary scale of 
the Survey Section was taken as the pri- 
mary criteria for the selection of three 
groups: a Low group of 22 Ss whose vo- 
eabulary scores placed them at or below 
the lower quartile of the eighth grade, a 
High group of 23 Ss with vocabulary 
scores at or above the upper quartile of 
the twelfth grade, and a Middle group of 
20 Ss with intermediate vocabulary scores. 
Thus there resulted three groups of ap- 
proximately equal size, with no ove-lap on 
the vocabulary criterion, whose tesied vo- 
cabulary ability closely corresponded to 
the readability of the two forms of the 
Study of Values. The two forms were 
scored for each S, and a “deviation score” 
was also computed. This is merely the 
difference between S’s score on a scale on 
one form and his score on the same scale 
on the alternate form, either positive or 
negative (sign being dropped), and 
summed for all six scales. 


REsuLTs 


The results of the judgments indicate 
that, in the opinion of these judges, the 
revised items do ask the same thing as 
the original items. The judges made the 
same categorizations of the original and 
the revised items; i.e., there was not a 
statistically significant difference between 


JEROME LEVY 


TABLE 1 


or t or ORDER FOR THE HiGcu, Mippie- AND Low Gaov PS 


conomic 


Aesthetic 


Religious, 


Social ‘Political 


Low 


Middle Group 


1.07 | ll 


-26 | .52 


High Group 


® 05 = 2.086, df = 20 
> 05 = 2.101, df = 18 
© 05 = 2.080, df = 21 


the type judgments on the pairs of items. 
The results of the item-to-item match 
show that, of the total of 506 matches the 
judges made, 495—or 98%—were matched 
correctly. 

The Flesch formula yielded a “Reading 
Ease Score” of 52.7 for the 1951 Revision, 
placing it at the twelfth grade level: the 
score cf 72.6 for the modified form is at 
the seventh grade level. 


TABLE 2 
ANALYsIS OF VARIANCE TO DETERMINE 
SIGNIFICANCE OF BETWEEN-Form Ditr- 
FERENCES FOR THE Low Group, As 
CoMPARED TO BETWEEN-Form 
FERENCES FOR THE Group 


Mean F 


Between 
Within 


304.579) 3.955 


4,774.780| 62 | 77.103} 


Total 5, 383.938 64 


High 


17.043 


Low | Middle 
Group | Group k 


22.150 


Differ- 
ences 


Since some of the Ss in each of the 
three groups took the forms in original- 
modified order, and others took them in 
modified-original order, t tests were com- 
puted to determine whether the order in 
which the forms were taken has a signi- 
ficant effect on the between-form differ- 
ences. As may be seen from Table 1, none 
of these t’s was significant at the 05 
criterion level. It was therefore possible 
to combine all of the Ss within each of 
the three groups, and to treat each of the 
groups as a single unit for further statis- 
tical evaluation. 

As a test of the main hypothesis—that 
is, that the Low group would show sig- 
nificantly greater between-form differ- 
ences than would the High group—an 
analysis of variance was performed uti- 
lizing the deviation scores. The F value 
of 3.955 which was obtained, as shown 
in Table 2, is significant at, and beyond, 
the 05 level of confidence. This clearly 
substantiates the major hypothesis. The 
primary characteristic on which these two 
groups differ is that of diagnosed vocab- 
ulary ability, and the significantly 
greater between-form differences in the 


8 
| | | | ong 
=| 
oo 
— 


READABILITY AND TEST PERFORMANCE 9 


Low group seems clearly related to their 
low vocabulary level. The group means 
of the difference scores, which are also 
contained in Table 2, bear out the hypo- 
thesis that as the more difficult form 
moves away from the Ss’ level of vocab- 
ulary ability they make increasingly 
different scores than they do on the form 
which is within their level of competence. 

The groups show a good deal of con- 
sistency in mean scale values from one 
form to another, as indicated by the data 
in Table 3. On the Theoretical scale, the 


Low group scores lowest on both forms, 
the Middle group is intermediate on both 
forms, and the High group scores highest 
on both forms. The Aesthetic scale shows 
the reverse order, with the same relative 
consistency on both forms. The t tests were 
computed to determine the significance 
of change in performance on each of the 
six seales for the High and Low groups: 
these data are presented in Table 4. 
The hypotheses employed in this in- 
vestigation state, in effect, that the two 
forms of the Study of Values are parallel 


TABLE 3 
Mean ScaLe Vatvues ACHIEVED BY THE Low, MippLeE aNp Groups 


ON THE 1951 Re 


Political Religious 


1951 Revision 
Mean 
Modified Mean 


1951 Revision 
Mean 
Modified Mean 


High 


1951 Revision 
Mean 
Modified Mean 


44.826 
44.304 


30.913 
31.478 


40.260 
40.260 


TABLE 4 
t Tests or SIGNIFICANCE OF DIFFERENCES IN PERFORMANCE BETWEEN 
1951 Revision AND MopiFIcaTION ON Eacu OF THE 


Six ScaLes ror anp Low Groups 


Theoretical Economic 


Aesthetic 


Social Political Religious 


Low Group 


37 
NS 


High Group 


1.55 
NS 


AND THE MODIFICATION 
Group Theoretical Economic Aesthetic Social Po 
Low 
pO 43.136 40.272 34.181 40.454 42.772 39.181 
40.136 42.363 33.954 41.045 42.454 40.045 
Middle 
44.500 43.550 32.350 35.250 43.350 41.000 
44.950 42.150 33.600 36.850 41.300 41.150 
49.347 33.173 41.378 
t 3.86 2.29 | 1.36 1.13 1.77 
P 001 .05 NS NS NS 
t 1.47 1.18 || 2.62 46 1.97 
P NS NS .02 NS NS 


10 


JEROME LEVY 


TABLE 5 


Test-RETEsT CORRELATION COEFFICIENTS FOR THE 1951 REviston AND OBTAINED 
CORRELATION COEFFICIENTS BETWEEN PERFORMANCE ON THE 1951 REVISION 


1951 Revision 
(Test-Retest) 


.87 
.92 
.90 


Theoretical 
Economic 
Aesthetic 
Social 77 
Political .90 
Religious 91 


| Low Group correla- | High Group correla- 
| tions between 1951 | tions between 1951 


‘Revision and Modified Revision and Modified 


76 
-90 
68 
.89 
91 


Tests of Significance Between Correlation Coefficients 


High to Low 
Critical Ratio 


Low to 1951 Revision 
Critical Ratio 


High to 1951 Revision 
Critical Ratio 


Theoretical 
Economic 
Aesthetic 
Social 
Political 
Religious 


2.68" 
4.18¢ 
3.26% 
79 
3.57% 
2.59" 


1.35 
1.47 
0.00 
-30 
0.00 


* 01 level 
> 001 level 
0001 level 


forms for the High vocabulary group 
where there are not the interfering effects 
of vocabulary limitations, and are not 
parallel forms for the Low group where 
these limitations do exist. If the two forms 
are parallel, the correlation between per- 
formance on the Modification and on the 
1951 Revision should not differ signifi- 
cantly from a test-retest correlation of 
performance on the 1951 Revision alone. 
In other words, if the two forms are 
equivalent there should not be a statis- 
tically significant difference between the 
correlation coefficients obtained by giving 
the 1951 Revision twice, and those ob- 
tained by correlating performance on the 
two forms. Pearson product-moment cor- 
relation coefficients were computed to 
determine the degree of relationship of 
the two forms in the High and Low 
groups. To determine whether these r’s 
are significantly different from each other 


the r-to-z transformation and critical ratio 
test of significance was employed. These 
data are presented in Table 5. Because 
of the impossibility of obtaining test-retest 
data on this group of Ss, these data are 
taken from the 1951 Revision Manual 
(1). 


Discussion 


It seems reasonable to expect that 
people who are highly theoretically ori- 
ented are unlikely to have a very low 
vocabulary level. Omitting any question 
of cause and effect, it does seem likely 
that there is a positive relationship be- 
tween these areas. People who are theo- 
retically oriented are likely to spend a 
good deal of time reading in many areas 
of knowledge, and in other ways are apt 
to develop a well-organized and compre- 
hensive vocabulary. Conversely, persons 
not theoretically oriented may reasonably 


|_| 
AND THE Mopiriep Form ror THE HiGH anp Low Groups 
— .60 | 
.53 
.60 
.74 
.55 
.72 
1.27 
2.65* 
3.27% 
51 
3.18" 
2.60* 
4 


be expected to engage less in these activ- 
ities. What the writer proposes, in effect, 
is that the higher mean Theoretical score 
found on the original form is spuriously 
high. When items are not understood, 
choices are likely to be made on the basis 
of something other than true preference, 
and it is suggested that the higher mean 
for this group on the original form is due 
largely to noncomprehension. When, on 
the other hand, they are presented the 
same alternative choices in language that 
they can understand, they tend to score 
lower. Since as a group they are not highly 
theoretically oriented, the group mean 
score on this scale will fall as the meaning 
of the items become clear. If this view 
is correct, then the modified form is in- 
deed a more valid measure of theoretical 
values in this group. 

The lower Economic scale scores for the 
Low group on the 1951 Revision is sig- 
nificant at the .05 level. It is suggested 
that this result again reflects the effects of 
noncomprehension. It is hypothesized that 
when choices having to do with economic 
values are presented to Ss in the Low 
group in language that is meaningful to 
them, they score higher on the Economic 
seale because this reflects a “true” under- 
lying orientation in this direction. 

The results shown in Table 4 also in- 
dicate that there is one scale on which the 
High group Ss make significantly different 
scores on the two forms: the Social scale. 
Two tentative explanations may be ad- 
vanced for the significantly higher score 
on the modified than on the original form: 
(a) this may be a chance phenomenon, 
or (b) real differences may arise on this 
seale in a highly verbal group as a result 
of change in wording. A cross-validational 
study to investigate the meaning of this 
result is necessary. 

Interpretation of the results of this 
study seems simple and straightforward. 
Since none of the between form correla- 
tions for the High group differs signifi- 
cantly from the test-retest correlation 


READABILITY AND TEST PERFORMANCE 


11 


coefficients, the modified form is con- 
sidered a parallel form for this group. 
For those Ss where there are not the 
vocabulary limitations, the modification 
yields results that do not differ signifi- 
cantly from that which would be obtained 
through the use of the 1951 Revision, 
and the two forms may be considered 
equivalent for this group. However, the 
significant CRs obtained in the com- 
parison for the Low group indicate that 
the modification is not a parallel form 
for this group. Since the major char- 
acteristic on which the groups differ, and 
the basis on which they were selected, 
is differential vocabulary ability, it seems 
clearly evident that it is the vocabulary 
proficiency of the Ss which determines 
differential performance on the two forms. 
If the Ss can understand the language, 
then the modification is a parallel form 
of the Study of Values. If, however, the 
Ss’ language ability is not sufficient to 
cope with the 1951 Revision, the differ- 
ences in performance on the two forms 
are so great as to indicate that they 
represent different tasks. 

Since the modification is a parallel form 
for highly verbal Ss, it is likely that it 
is as valid for these Ss as is the 1951 
Revision. It is the writer’s opinion that 
the modification is a more valid form for 
low verbal people. Since the level of 
language is brought within the compre- 
hension of these low verbal Ss, random 
response choices based on noncompre- 
hension are likely to be decreased. Re- 
sponses based on an understanding of 
what the question is really asking are 
more likely to reflect truly the S’s ori- 
entation on the dimension being measured. 
External validation studies with different 
occupational groups, with other _ test 
criteria, and with generalized measures 
of personality are necessary for an em- 
pirical demonstration of validity. If on 
further investigation this modification re- 
ceives external validity support, it may 
well serve a useful function in psycho- 


|_| 


12 


logical evaluation. It will provide the 
same sort of personality information as 
does the 1951 Revision for highly verbal 
Ss and will permit extension of the test 
to populations hitherto considered inap- 
propriate. 

The factor of verbal level as an im- 
portant variable in pencil-and-paper psy- 
chological test results has, of course, been 
realized for some time. But this has been 
at a rather gross level. This is the first 
study, to this writer’s knowledge, which 
has attempted to manipulate verbal fa- 
cility as an experimental variable and to 
evaluate statistically the effect which this 
has on test performance. With the grow- 
ing concern about readability and com- 
munication in general, it is not likely to 
be the last. A good deal more study re- 
mains to be done with the modification. 
An item analysis is planned for the future. 
A covariance analysis of the relationship 
between differential test performance, ver- 
bal level, and intelligence is likely to 
clarify other important variables. But a 
promising start has been made in pro- 
ducing a meaningfully equivalent, yet less 
verbally complex, form of this popular 
test, and it has been possible to demon- 
strate the significant role which verbal 
facility plays in pencil-and-paper test re- 
sults. 


SuMMARY 


This paper reports an attempt to pro- 
duce a meaningfully equivalent, but less 
verbally complex, form of the Study of 
Values, and to demonstrate the significant 
role which language facility plays in de- 
termining score patterns on pencil-and- 
paper test of personality. Pilot data with 
the modified form indicates that it is 
judged as asking the same things as the 
Study of Values. Flesch counts of the 
1951 Revision and of this modification 
indicate that they are at the twelfth 
grade level and seventh grade level, re- 


spectively. Three experimental groups: - 


were established: a Low vocabulary 


JEROME LEVY 


group, a High vocabulary group, and a 
Middle group. The two forms were ad- 
ministered to these Ss. Analysis of the 
data indicates that for the High group 
the modification is an equivalent form. 
The Ss in the Low group make signifi- 
cantly different scores on the two forms, 
and the forms are not equivalent for 
this group. The differences in performance 
in this group are attributed to a vocab- 
ulary level inadequate to deal with the 
1951 Revision, and it is suggested that 
for these Ss the modification provides a 
more valid test of value orientations. 
Much additional exploratory work re- 
mains to be done with the modification 
before it may be completely accepted as 
a valid, equivalent form for all Ss. This 
study serves to emphasize the important 
role of language facility in all pencil-and- 
paper personality tests. 


REFERENCES 

1. Atiport, G. W., Vernon, P. E., & Liypsey, 
G. Study of Values (Rev. ed.): Manual 
of Directions. Boston: Houghton Mif- 
flin, 1951. 

2. Commirree on Diacnostic Reapine Tests, 
Inc. Description of the purposes and 
functions of the Diagnostic Reading 
Tests. Educ. psychol. Measmt. 1948, 
8, 3-14. 

3. Fiescn, R. How to test readability. New 
York: Harper, 1951. 

4. Goveu, H. G. In O. K. Buros (Ed), 
The fourth mental measurements year- 
book. Highland Park, New Jersey: 
Gryphon Press, 1953. Pp. 156-157. 

5. Hess, D. C., & Brinpra, D. Scientific 
writing and the general problem of 
communication. Amer. Psychologist, 
1952, 7, 569-573. 

6. Levy, J. Reducing the language com- 
plexity of the Study of Values: A 
revision. Unpublished doctoral disser- 
tation, Univer. of Denver, 1956. 

7. Oavon, D. P. Flesch counts of eight cur- 
rent tests for introductory psychology. 
Amer. Psychologist, 1954, 9, 143-144. 

8. Stevens, S. S. & Srone, C. Psychological 
writing, easy and hard. Amer. Psy- 
chologist, 1947, 2, 230-235: Comment 
523-525. 


Received July 12, 1957. 


Journat or Epucationat 
Vol. 49, No. 1, 1958 


THE ATTITUDES STUDENTS ASSIGN TO THEIR TEACHER 


NORMAN M. CHANSKY 
Oswego State Teachers College 


How students see and judge their teach- 
ers are behavioral operations which have 
more than theoretical importance. The 
validity of student judgments has never 
been determined, yet research studies in 
this area tacitly assume the factor of 
validity. That different students viewing 
the same instructor assign him varied, 
even contradictory, attitudes warrants in- 
quiry into the factors associated with such 
judging. The purpose of the present study 
was to examine whether attitudes assigned 
to a teacher can in any way differentiate 
between students with authoritarian out- 
looks and those with democratic outlooks. 


PROCEDURE 


After lecturing for three weeks about 
prenatal development, postnatal develop- 
ment, and theories of child development, 
the writer administered the Minnesota 
Teacher Attitude Inventory (MTAI) (3) 
to his classes in Child Psychology. The 
MTAI is a measure of attitude toward 
democratic attitudes and practices in teach- 
ing, based on the F Scale of the authori- 
tarian personality series of studies (1). 
High scores on the MTAI indicate a demo- 
cratic attitude; low scores, an authori- 
tarian attitude. 

After having answered the items of the 
MTAI, the students were asked to write 
on the back of their answer sheets what 
attitudes toward children they thought 
their instructor held. Care had been taken 
during the three introductory weeks of 
the course to avoid controversial issues in 
child psychology. At no time did the in- 
structor consciously cue the students to 
his point of view. 

During the subsequent 12 weeks of the 
course, lectures and course activities cen- 
tered on (a) the law of effect as related 


to the development of behavior in chil- 
dren, (b) factors associated with accept- 
ance of children, and (c) the childhood 
antecedents of self realization. 

At the end of the course, the MTAI 
was readministered and the students were 
asked once again to state the attitude 
toward children their instructor holds. 


RESULTS 


A content analysis of the first set of 
student statements of the attitude which 
the instructor was presumed to hold 
yielded seven distinct categories. These 
categories were: 

(1) freedom of children to manipulate 
the environment 

(2) development of socially precise be- 
haviors in children with punishment for 
deviants (discipline) 

(3) a cold, impersonal attitude toward 
children 

(4) the development of independence in 
children 

(5) respect for children 

(6) warm, friendly, personal relation- 
ship with children 

(7) helping children with learning and 
emotional problems (clinical). Some atti- 
tudes could not be classified and were not 
included in the results. 

The specific statistical tests (See Table 
1) indicate that those students who felt 
their teacher would encourage freedom in 
children received significantly higher 
MTAI scores than any other group. On the 
other hand, those students who saw their 
instructor taking a clinical attitude toward 
children received significantly lower MTAI 
scores than most of the other groups. 
Their scores were not reliably different 
from either the discipline or the imper- 
sonal groups. In addition, the group who 


NORMAN M. CHANSKY 


TABLE 1 


Ratios ror THe Instructor ATTITUDES 


Berore Exposure to TEAcHER ATTITUDE 


Mean 
Instructor Atti- 
Number MTAI " 

tude Disci- 

pline 

Freedom 16 53.25 4.83> 
Discipline 12 15.42 
Impersonal 9 31.77 
Independence 12 38.04 
Respect 12 31.50 
Personal 6 37.50 
Clinical 9 66 


t Ratios for differences between attitude groups 


Independ- 


gpect | Personal |Clinical 
2.19° | 2.398 2.278| 2.215 | 4.33 
—1.81 | —5.02* | 1.86 | —3.45> | 0.78 
0.77 | 0.02 | —0.70 | 1.70 

1.04 0.12 | 3.50 

-0.75 | 

3.15 


Note.—deg of freedom are Ni + Ni — 2. 
* Significant at P .05 level. 
» Significant at P .01 level. 


* Significant at P .001 level. 


looked upon their instructor as a discipli- 
narian received significantly lower MTAI 
scores than the freedom, independence, 
and personal groups. 

These statistical tests indicate that, in 
the absence of well-defined cues, students 
projected their own attitudes toward the 
instructor (2). The reasoning behind this 
interpretation was as follows. Since cues 
to the attitude of the teacher toward chil- 
dren were minimized, the necessary ambi- 
guity for projection was present. Further- 
more, when those students who assigned 
such attitudes as freedom, independence, 
respect, usually associated with democ- 
racy, received higher MTAI scores, in- 
dicative of a democratic attitude, and 
when those students who perceived their 
instructor as either a disciplinarian or a 
clinician received lower MTAI scores, in- 
dicative of an authoritarian attitude, the 
interpretation of projection was confirmed. 
The relationship between discipline and 
authoritarianism is not unexpected, but 
why a service attitude such as that which 
a clinician holds received such low MTAI 
scores can be reconciled only if the clini- 
cal attitude represents an expression of 
the deep dependency needs of the authori- 
tarian personality. If this hypothesis be 


~ubstantiated in subsequent studies, it may 
be worthwhile investigating whether this 
clinical attitude means sensitivity to the 
needs of children or if it is a subtle way 
of expressing the wish for an anaclitic re- 
lationship with an authority figure. 

Data obtained after exposure to the 
teacher's attitude are reported in Table II. 
Four new categories appear in the content 
analysis of the student assigned attitudes 
of the teacher. These are patience, school 
learning, no punishment, and understand- 
ing. A chi-square test was made to de- 
termine whether students assigned dif- 
ferent attitudes to their instructor after 
he had presented his point of view. The 
data yielded the non-significant chi-square 
or 10.91 (10 df). Any change that took 
place can be attributed to chance. A 
coefficient of contingency of 46 was also 
obtained. These data further suggest that 
there was some consistency in respond- 
ing to the teacher-attitude question on the 
two testings even after exposure to his 
attitude. 

The results of the retest suggest there 
was greater variability than there had 
been in the first testing. While the freedom 
group had received significantly higher 
MTAI scores than any other group during 
/ 


ATTITUDES ASSIGNED TO TEACHERS 


TABLE 2 
Ratios ror THE INstructToR ATTITUDES 
Arter Exposure To TEACHER ATTITUDE 


| neean | t Ratios for differences between attitude groups 


Instructor 
N MTAI 
Attitude | | score | Pa- 'Learn- 


ing 
20 56. 00 0. 95, 2. 39") 
Patience 5 46.60) 1.24 | 


Under- 


No pun-  Disci- Independ- 
Personal | standing 


Clinical ishment pline ence 
3.40% 1.05) 1.16 6.17°| 0 

1.53 —0.16) —0.27 3.44 | —0.96 
Learning | 8 29.50 —0.11 | —1.25) —1.52 1.50 | —1.89 
Clinical 6 31.00) | —2.33* 2.41" | —3.04 
Respect 12) 47.77 —0.07 4.19 | —0.91 


0.05 
—0.65 
—1.62 
—1.72 
—0.60 


No pun- | 7 48.71 
16 10. 


Personal 14 


Under- 
stand- 
ing 


5.09° | —0.94| —0.54 


—3.46> 
0 


—5.43° 


Note.— Degrees of freedom are N; + Ni — 2 
* Significant at P 05 level. 

> Significant at P .01 level. 

© Significant at P .001 level. 


the pretest, the retest results indicate that 
the freedom group can only be distin- 
guished from the emphasis on school learn- 
ing, clinical, and the discipline groups. 
The emphasis on school learning is a 
characteristic of the authoritarian per- 
sonality according to the authors of the 
MTAI. The results of the retest indicate 
further that these three authoritarian 
groups received significantly lower MTAI 
scores than many of the democratic at- 
titude groups. The clinical group attained 
significantly lower scores than the freedom, 
no punishment, and personal contact 
groups; the school learning received lower 
seores than the freedom group; and the 
discipline group received significantly 
lower scores than all groups except the 
school learning. It is interesting to note 
that the clinical group was more demo- 
cratic in attitude than the discipline group. 

Again, students assigning democratic 


attitudes to the teacher received higher 
MTAI scores, indicative of a democratic 
attitude; students assigning antidemo- 
cratic attitudes, received lower MTAI 
score, indicative of an antidemocratic at- 
titude. 


SuMMARY AND CONCLUSIONS 


The purpose of this study was to de- 
termine whether students’ authoritarian 
or democratic attitudes toward children 
were in any way associated with their 
perception of their teacher. 

Not having been cued to their teacher's 
attitude during the first phase of the ex-. 
periment, the differences between those 
who saw their instructor as one who would 
encourage freedom, understand, or re- 
spect children and between those who saw 
him as a disciplinarian or helper of the 
helpless was interpreted in light of the 
projective hypothesis. Students, without 


16 NORMAN M. CHANSKY 


their awareness, assigned attitudes which 
they themselves held. 

After having been continuously cued to 
the teacher’s attitude thereafter, the ear- 
lier vagueness disappeared experimentally, 
making projection less tenable as an expla- 
nation, but not ruling it our entirely. The 
writer believes the students assigning the 
different attitudes to their instructor oper- 
ated within different frames of reference. 
In this study, two frames of reference were 
possible: a democratic and an authori- 
tarian. Within a democratic frame of ref- 
erence, students selectively perceived in 
the teacher evidence for freedom of, pa- 
tience with, understanding of, respect of, 
and no punishment of children. Within 
an authoritarian frame of reference, stu- 
dents saw their instructors as being pri- 
marily interested in discipline, school 
learning, and children with psychological 
problems. 

Certain attitudes which students as- 


sign to their instructor, then, differentiate 
between democratic and authoritarian stu- 
dents. In addition, there is evidence that 
rating of attitudes in another person will 
be influenced to a great degree by the at- 
titude the rater himself holds. Hence, there 
is a constant error in the judgment of 
democratic attitudes in others. Demo- 
cratic raters are apt to give more demo- 
cratic ratings; authoritarian raters are apt 
to give more authoritarian ratings. 


REFERENCES 

1. Aporno, T. W., Frenxet-Brunswik, Evse, 
Levinson, D. J., & Sanrorp, R. N,, 
The authoritarian personality. New 
York: Harper, 1950 

2. Cuansky, N. M., How students see their 
teacher, Ment. Hyg., 1958, 42, 118-120. 

3. Coox, W. W., Leeps, C. H., & Carus, R., 
Minnesota Teacher Attitude Inventory. 
New York: Psychological Corporation, 
1951. 


Received August 1, 1967. 


Journat or Epucationat 
Vol. 49, No. 1, 1958 


THE SCHOOL PROGRESS AND ADJUSTMENT OF 
UNDERAGE AND OVERAGE STUDENTS! 


CLYDE J. BAER 
Kansas City, Missouri, Public Schools 


The purpose of this study was to inves- 
tigate the question of whether or not a 
child who begins school underage experi- 
ences similar problems and achieves the 
same level of development as if he had 
waited a year to enter school. 

In this city school system a chrono- 
logical age of five by November first is 
required for regular entrance into the 
kindergarten. However, children who will 
be five during November or December 
and who score a mental age of 5-0 or 
above on an individual intelligence test 
may be admitted. This study was made 
because some administrators and teachers 
in this school system feel that many of 
those children who enter prior to age five 
are too immature to be in school. How- 
ever, there is no mid-year entrance or 
promotion and children who are not ad- 
mitted must wait a full year to enter 
school. 


PROCEDURE 


Seventy-three children with birthdates 
in November and December were matched 
with 73 children with birthdates in Janu- 
ary and February who were in the same 
school grade and who had entered kinder- 
garten in September of the same year. The 
groups were matched on the bases of in- 
telligence quotient, sex, and, in about two 
thirds of the cases, the school entered. 

During their eleventh year in school 
the groups were compared on the bases 
of physical size at the time of the study, 
grade level attained, number of problems 


This article is based on the writer's doc- 
toral dissertation, the University of Kansas, 
1955. Grateful acknowledgement is made to 
Henry P. Smith, under whose direction this 
work was done 


marked on the Science Research Associ- 
ates Youth Inventory and scores on the 
Guilford-Zimmerman Temperament Sur- 
vey. From the cumulative records, com- 
parisons were made of marks in elemen- 
tary and high school subjects, achievement 
test scores, teacher rating on personal 
traits, and number of absences. 


REsvULTS 


Table 1 describes the groups studied. 

The overage and underage students were 
not significantly’ different in intelligence, 
as measured by the Revised Stanford- 
Binet, Form L, administered at time of 
their entrance into kindergarten or during 
the regular school year. For the underage 
students the range of 1Q was from 103 to 
127, and for the overage students the 
range was from 101 to 126. 

After eleven years in school, the over- 
age students were significantly taller but 
not significantly heavier than the under- 
age students. They also had been signifi- 
cantly’ more successful in maintaining 
regular progression from grade to grade 
than the underage students. 

During the elementary school years 
(kindergarten through Grade eight) the 


* Unless otherwise specified, each signifi- 
cant difference reported here was statistically 
significant at the 01 per cent level of con- 
fidence, and was obtained by the application 
of the ¢ test of statistical significance, where 

NzD* (D)? 


— 1) 


*In this instance the chi-square test of 
statistical significance was applied, and in- 
dicated a level of confidence between two 
and five per cent. 


18 CLYDE J. BAER 


TABLE 1 
Numser, IQ, Ace, Heicut, WEIGHT, AND 
GRADE OF OVERAGE AND UNDERAGE 
StupENTs STuDIED 


Underage 


Boys | Girls 


Number 42) 31 
Mean IQ 111.24) 111.40 
Age 
15-5 24! 
15-6 18 
16-3 
16-4 | 
Height 5'6” 
Weight 
Grade 
9 
10 
ll 


overage students were marked significantly 
higher than the underage students, but the 
differences between the overage and under- 
age students tended to decrease as higher 
grade levels were reached. The girls con- 
sistently were marked higher than the 
boys of their respective groups, but this 
difference showed no identifiable tendency 
either to increase or decrease. In the high 
school the marks received by the overage 
students were significantly higher than 
the marks made by the underage students. 


TABLE 2 
Mean RatInGs ON PERSONAL TRAITS FOR 


Over- Differ- 


Un- 


. Participation in 
Group Activity 


. Attitude Toward 
School Regulations 

. Appearance 

. Dependability 

. Emotional Stability 

. Initiative 

. Cooperativeness 


‘Significant at the .01 % level 


Achievement test scores reported at 
various grade levels during the elementary 
school showed that the overage students 
achieved significantly higher scores in 
reading for Grades three, six, and eight; 
in arithmetic for Grades four, six, and 
eight and in social studies for Grade five. 
The difference between overage and under- 
age were not significant in spelling for 
Grade five; language for Grades five and 
eight; and science for Grade seven. 

Near the close of each school year 
every student is rated by his teacher on 
each of seven traits. Ratings recorded for 
each grade level from three through eight 
for each trait were summed and the mean 
taken for each pupil. Table 2 shows the 
mean group ratings for overage and under- 
age students according to a scale of one 
to five, with five being the highest rating. 
For all traits the overage students were 
rated significantly higher than the under- 
age students. 

The girls were rated higher than the 
boys of their respective groups on all 
traits. For three traits, Attitude Toward 
School Regulations, Dependability, and 
Emotional Stability, the differences be- 
tween boys and girls were greater than the 
differences between underage and overage. 

The number of problems marked by 
overage and underage students on a prob- 
lem inventory was not significantly differ- 
ent. Although the overage and underage 
boys tended to mark the same problems, 
the underage boys marked more problems 
relating to home and family. The over- 
age girls marked more problems in the 
category dealing with school than in all 
the other categories combined. Although 
this category was also the most popular 
with the underage girls, they marked al- 
most as many problems in the section 
“After High School.” 

The results on the Guilford-Zimmerman 
Temperament Survey indicate that the 
overage boys tend to be less inclined to be 
suspicious or to see personal reference in 


Overage 
Boys | Girls | 
: 
OVERAGE AND UNDERAG® STUDENTS 
: derage| age | ence 
2 | 3.84 | 4.30 | .46* 
3 | 4.37 | 4.64 | .27° 
4 | 3.66 | 4.27 | .61* 
5 3.71 4.70 | .99* 
6 | 3.22 | 4.01 | .79* 
7 | 3.64 | 4.28 | 
of confidence 
— 


ADJUSTMENT OF OVER- AND UNDERAGE STUDENTS 19 


the words or actions of others, and the 
overage girls tend to be somewhat more 
socially aggressive, with greater drive and 
energy. All of the mean scores for the girls 
and all but five of the 20 mean scores for 
the boys fall in the average range. 
Reported absences in elementary school 
show little difference between overage and 
underage students. From _ kindergarten 
through Grade eight the total median 
number of absences per year for both 
overage and underage boys was 6.50 days. 
For underage girls the total median was 
8.00 days, and for overage girls it was 
8.50 days. For both overage and underage 
the trend seemed to be for the greatest 
number of absences to oceur in the pri- 
mary grades, decrease in number during 
the intermediate grades, and then increase 
again at about Grade eight, with the girls 
showing the higher rate of increase. 


CONCLUSIONS 


As a group, the overage children made 
better school progress than did the under- 
age children. The overage children, from 


kindergarten through Grade ten, made 
significantly higher marks in subjects, sig- 
nificantly higher scores on achievement 
tests in reading, arithmetic, and social 
studies, were rated significantly higher on 
personal traits by their teachers, and were 
significantly more successful in maintain- 
ing regular progression from grade level 
to grade level. 

That the differences between boys and 
girls were greater than the differences be- 


tween overage and underage for three of 
the personal trait ratings may indicate a 
sex-associated factor in these ratings. The 
overage and underage boys marked es- 
sentially the same problems on a youth 
inventory, but the two groups of girls 
did not show as much agreement on prob- 
lems as did the boys. 

Although there is some evidence that 
the differences between the overage and 
underage students tended to decrease with 
higher grade levels, perhaps this is what 
should be expected since the advantage in 
mental age that the overage group carries 
in the elementary school grades tends to 
decrease as the students get older. 

Before concluding that it would be bet- 
ter for underage children to wait until 
the next year to begin school, it should be 
noted that most of the underage children 
made average school progress. As a group, 
they made average marks in subjects, 
average scores on achievement tests, re- 
ceived average ratings by their teachers 
on personal traits, and did not mark sig- 
nificantly more problems on the problem 
inventory than did the overage students. 

However, it should be remembered that 
both the overage and the underage chil- 
dren studied here were selected on the 
basis of intelligence (average 1Q of each 
group about 111). Thus, a better than 
average performance may legitimately be 
expected for either group on certain of 
the measures used. 


Received August 4, 1967. 


» 


JOURNAL OF EpvucATIONAL PsycHoLoGcy 
Vol. 49, No. 1, 1958 


A FURTHER NOTE ON BASAL METABOLISM 
AND ACADEMIC PERFORMANCE 


MARY E. YARBROUGH 
Meredith College 


anp HAROLD G. MCCURDY 


University of 


It is sometimes necessary to conclude 
that an attractive hypothesis is incorrect. 
In 1947 one of the present authors re- 
ported in this Journal a substantial cor- 
relation between BMR and the academic 
performance of a sample of college women 
(2). When his correlation was combined 
with the lower, but still positive correla- 
tion reported in an earlier study by Pat- 
rick and Rowles (4), the resulting r of .24 
seemed large enough to justify the specu- 
lation that college success might owe some- 
thing to basal metabolic rate. It was 
stressed at the time, however, that the 
hypothesis called for a wider sampling of 


the college population. In the light of the 
further data which we can now offer, the 
Meredith College sample of 1947 appears 
as a statistical aberration and the sug- 
gested hypothesis untenable, since the true 
correlation between the variables at issue 
seems to be in the neighborhood of zero. 


New AnpD Otp Data 


In Table 1 are presented the data on 
which we base our rejection of the old 
hypothesis. It speaks for itself, but a few 
words regarding the studies which it sum- 
marizes will not be out of place. 

The studies are listed in the table in 
roughly chronological order. It will be 
noticed that the information from Om- 
wake, Dexter, and Lewis (3) was avail- 
able in published form in 1934. It was un- 
fortunately overlooked by McCurdy in 
1947, but it would not have altered the 
suggestion made at that time. These au- 
thors state regarding their 72 Agnes Scott 
students: “Those making a high scholastic 


North Carolina 


average tend to have high metabolism, 
but little relationship between poor schol- 
arship and metabolism is evident.” Their 
correlation of .138, when combined with 
the .05 of Patrick and Rowles and the 
53 of MeCurdy, contributes to a correla- 
tion (by Fisher's z transformation) of .19 
for 152 individuals, which is significant at 
slightly better than the three per cent 
level. 

The three new sets of data, all inde- 
pendently gathered, are those of Schutte 
at California, McCurdy at North Caro- 
lina, and Yarbrough at Meredith. The 
Schutte dissertation (5) has not been 
published, and the published abstract 
came to our attention quite recently; but 
the author has kindly furnished us with 
the important data which appear in the 
table in a personal communication. 

Concerning the other two studies of re- 
cent date a few details can be given. 
McCurdy at the University of North 
Carolina at Chapel Hill, through the 
kindness of E. McG. Hedgpeth of the 
University Infirmary, secured a large col- 
lection of routine BMR records, of which 
a certain number applied to students. 
Some 560 of these were checked against 
the academic files of the Central Office of 
Records in order to find as many under- 
graduate students as possible who had 
taken courses falling within the central 
liberal arts curriculum. Many were ex- 
cluded because their names did not appear 
in the files, or because they were graduate 
students, or because their curricula were 
irregular. The final yield was 117 cases. 
Point-hour ratio was ascertained for these 


METABOLISM AND ACADEMIC PERFORMANCE 


TABLE 1 
SumMaAryY Stupres oN BMR anv ScHoLARsHIP 


Study 


Patrick & Rowles 
(Ohio University) 
Omwake, Dexter, & Lewis 
(Agnes Scott) 
McCurdy 
(Meredith) 
Schutte 
(University of California) 


MeCurdy 
(UNC) 

Yarbrough 
(Meredith) 


BMR  Point- Hour Correlation 


05 


53 
+7.3 
—13. 
—12.; 
+10.62 


— .06 
— .08 


— .05 


BMR and point ratio are not exactly from to study because of 


differences in the machines used and in the assignment of points to letter grades. 


cases in or near the academic quarter 
when the BMR was taken. These 117 
students ranged in age from 17 to 34; the 
distribution was skewed, and it was de- 
cided to cut off the older students at a 
break in the distribution, leaving a total 
of 102 with an age range of from 17 to 
24, symmetrically distributed around the 
mean age of 21. No significant differences 
appeared between the 65 men and 37 
women of this sample in age, point-hour 
ratio, BMR, or the relations between 
variables, and therefore the results are 
lumped together in the table. It was fully 
realized at the time that the manner of 
assembling these data did not afford a 
clean-cut test of the hypothesis, since errors 
might have crept in at several points; 
but the specific purpose was to discover 
whether the hypothesized relationship was 
strong enough to stand up in the face of 
the sort of random errors which might be 
encountered by a practical-minded college 
dean or physician working upon the same 
hypothesis. The physician who lent the 
BMR records, incidentally, felt that the 
hypothesis was perhaps a realistic one. 
Yarbrough at Meredith, on the other 


hand, attempted to follow strict criteria 
both in sampling the students and in 
taking the BMR readings. The students 
were all in their first year of college, 
taking virtually the same program, and 
beir subjected to very similar require- 
ments by teachers who value good aca- 
demic work. Three BMR records were 
taken on each girl, and each record was 
carefully inspected for possible technical 
flaws. As in the earlier Meredith study, 
the record yielding the lowest BMR for a 
given individual was utilized in the corre- 
lational analysis. In all essential respects 
the experimental procedures and the na- 
ture of the sample seemed comparable to 
those of the earlier study. But the correla- 
tion between BMR and scholarship was 
definitely much lower. 

It is clear from inspection of Table 1 
that the correlation of .53 in 1947 has to 
be considered as quite exceptional. We can 
think of no explanation except accident of 
sampling. The total weight of the evidence 
supports the view that BMR’s in the nor- 
mal range have little or nothing to do with 
scholarship at the college level. 


91 
atio 
52 F -7.3 1.50 
+8.97 +.51 
72 F 
28 F 
556 F 
377 M 
102 —8.50 1.13 
(37 F, +10.35 +39 
65 M) 
33 F —8.30 1.32 — 
+8.00 +.72 
» 


22 MARY E. YARBROUGH AND HAROLD G. MCCURDY 


CONCLUSION AND COMMENTARY 


The available evidence contradicts the 
plausible hypothesis that basal metabolic 
rate might affect academic performance in 
college. The correlation between the physi- 
ological and the intellectual indices used 
appears to be at or near zero. Perhaps 
the persuasiveness of the hypothesis in the 
first placed depended entirely too much on 
a superficial view of the interrelationships 
of energy, motivation, and thought. 

In psychological circles the null hy- 
pothesis is often regarded with distaste, 
and “negative” results are dreaded so 
much that, it is rumored, some investi- 
gators avoid repeating experiments lest 
chance should be against them the second 
time. If the rumor is true, then some of 
our theories must have very spindly 
foundations. No matter what the signifi- 
cance level of a single result may be, it is 
not sufficient by itself to establish a gen- 
eral working rule. The present summary 
of metabolic studies should make it clear 


24 RICHARD W 


that one correlation in a set of six may be 
freakishly different from the rest. 


REFERENCES 


1. Fisuer, R. A. Statistical meihods for re- 
search workers. Edinburgh: Oliver & 
Boyd, 1941. 

2. McCurpy, H. G. Basal metabolism and 
academic performance in a sample of 
college women. J. educ. Psychol., 1947, 
38, 363-372. 

3. Omwake, K. T., Dexter, E. S., & Lewis, 
L. W. The interrelations of certain 
physiological measurements and aspects 
of personality. Char. & Person., 1934, 3, 
64-71. 

4. Patrick, J. & Rowzes, E. Intercor- 
relations among metabolic rate, vital 
capacity, blood pressure, intelligence, 
scholarship, personality and other meas- 
ures on university women. J. appl. 
Psychol., 1933, 17, 507-521. 

5. Scuutre, H. M. An investigation of basal 
metabolic rate and college scholarship. 
Unpublished doctoral dissertation, Uni- 
ver. of California at Berkeley, 1951. 


Received August 13, 1957. 


Journat or EpucationaL 
Vol. 49, No. 1, 1958 


AN EXPERIMENTAL PROGNOSTIC TEST 
FOR REMEDIAL READERS 


RICHARD W. WOODCOCK 
Oregon College of Education 


A remedial reading program is usually 
a relatively expensive program since it 
may entail individualized or smal! group 
instruction in addition to the school’s 
regular full-time program. Limitations of 
finances and time usually limit the amount 
of service available and it is desirable to 
utilize some sort of selective procedure in 
choosing cases for remedial instruction. 

This article describes the development 
and evaluation of a test for selecting the 
remedial readers most likely to profit 
from special instruction. For the purposes 
of this discussion, a “remedial reader” is 
defined as any child whose reading 
achievement is significantly retarded be- 
low both his grade placement and his 
capacity or potential for reading achieve- 
ment. In addition, the child in mind is 
specifically lacking in reading skills usu- 
ally developed by the third or fourth 
grade. 

The test is designed to duplicate as 
nearly as possible the “learning-to-read” 
process for remedial readers. It was as- 
sumed that the primary learning task 
of the remedial reader is to learn associa- 
tions between written symbols and fa- 
miliar spoken language. By means of the 
experimental test an attempt has been 
made to provide the remedial reader with 
a controlled learning situation uniquely 
different from reading, yet so similar to 
the process of learning-to-read that per- 
formance on the test might be indicative 
of ability to profit from remedial reading 
instruction. 


DescripTION oF TEsT 


The experimental test is basically com- 
posed of five short stories written entirely 
with pictures and symbols instead of 


words. Each of these “test stories” is on 
a successively more difficult level and is 
preceded by a practice page which in- 
troduces the new symbols used at that 
level and provides practice with the sym- 
bols in context. The manner of presenting 
the material on these introductory pages 
is similar to that of a reading lesson. 
Figure la shows the first group of symbols 
and the following line of practice material 
for Level I. A portion of the Level V 
test story is shown in Figure 1b. 

The total vocabulary of the experi- 
mental test consists of 72 symbols. Two 
of these symbols represent the inflections, 
“ing” and “-s”. The “-s” inflection is 
used both as a plural and to denote pos- 
session. 

The vocabulary of the first level con- 
sists of 14 symbols. These symbols repre- 
sent such words as “boy,” “the,” “cat,” 
“is,” “black,” and so forth. Many of the 
symbols at the first level consist of stick 
drawings or simple pictures of the object 
the symbol is intended to represent. Each 
successive level adds approximately 15 new 
symbols to the vocabulary being used in 
the test stories. 

The symbols at the higher levels do 
not provide picture clues. There are some 
phonetic elements common to a number 
of the symbols. However, this is not 
pointed out to the Ss. 

The materials for administering the test 
consist of an answer sheet and a test 
becklet. The test booklet contains ten 
pages, two pages for each of the five 
levels, as described above. The answer 
sheet is used by the examiner in recording 
the errors made by the S while reading 
the test stories. 


The score on the test is the total 


PROGNOSTIC TEST FOR REMEDIAL READERS 25 


» 
23 


8 
+8 


RICHARD W. WOODCOCK 


& 
+ % 


+ 


Fic. 1. (a) First Two Lives From tue Lever I Intropuctory Pace (Boy, Girl, Dog, 
Cat, And—Boy and Girl, Cat and Girl, Boy and Dog) (b) First Two Lives From Tue 
Levet V Test Story (Mother went to a surprise party. So did the boy and girl. It was 


a little girl’s birthday so they took presents for her.) 


number of errors made on the five test 
stories read orally by the S. Hesitations 
of more than approximately five seconds 
(for which the word is supplied by the 
examiner), substitutions, omissions, and 
additions were considered errors. Repeti- 
tions were not so counted in this study. 


PROCEDURE OF THE StupYy 


1. The first step in the development of 
the instrument consisted of small-scale 
tryouts and revisions with a series of 
crude tests and procedures. From this 
work evolved the Experimental Prog- 
nostic Test for Remedial Readers de- 
scribed above. 

2. The experimental test was admin- 
istered to a complete fourth grade elass 


TABLE 1 
CoRRELATION OF EXPERIMENTAL TEST 
Error Scores with GATES 
Reaping Tests 


(N = 24) 
Vocabulary —.51 
Comprehension — .65 
Speed — .58 
Oral reading — .48 
Composite score — .59 


of 24 pupils at Francis Willard School 
in Eugene, Oregon during March of 1955. 
The resulting data were analyzed for re- 
liability and concurrent validity informa- 
tion. 

3. Next the test was administered to 
26 remedial readers in the Corvallis Public 
Schools for the purpose of evaluating the 
predictive validity. Each of these 26 re- 
medial readers was given the experimental 
test and a series of Gates reading tests. 
Following a four-month period of daily 
individual and small-group instruction 
these same children were given an alter- 
nate form of the reading tests. These data 
were analyzed for predictive validity in- 
formation. 


Concurrent validity. Twenty-four chil- 
dren in a regular fourth-grade classroom 
were administered the Gates Reading Sur- 
vey, the Gates Oral Reading Test, and 
the Experimental Prognostic Test. The 
experimental test was found to correlate 
—.59 with the composite measure of read- 
ing achievement. (Negative correlations 
are based on error scores.) Table 1 pre- 
sents the correlations of the experimental 


PROGNOSTIC TEST FOR REMEDIAL READERS 25 


test with the four reading achievement 
subtests. These correlations are all signi- 
ficant at the .05 level of confidence. 

A mean score of 58.0 errors was made 
on the experimental test by this fourth- 
grade class. The standard deviation of 
scores was found to be 22.8 errors. 

Reliability. The experimental test re- 
liability, as computed by the Kuder- 
Richardson Formula 21, is .91. This value 
probably represents an underestimate 
since the data do not meet the assumption 
of equal difficulty for all items in the 
test. The split-half correlation was .92 
and a reliability of .96 is obtained when 
this value is corrected by the Spearman- 
Brown Formula. These values are prob- 
ably overestimated to a small degree. The 
standard error of measurement, based on 
the corrected split-half reliability, is 4.6 
errors. Since the standard deviation of 
scores, in this sample, is 22.8 errors it 
would seem to indicate that the test may 
have good discriminative properties. 

Predictive validity. Estimates of the 
predictive value of the experimental test 
were obtained from a group of cases re- 
ferred and selected for remedial reading 
instruction. Since the total number of 
cases was relatively small and spread 
among several age levels, it was not pos- 
sible to apply correlational procedures to 
the data which had been collected. An 
alternative method was developed which 
consisted of pairing the remedial cases 
and predicting, on the basis of the ex- 
perimental test scores, which member of 
each pair would show the most gain by 
the end of the training period. In order 
to pair the cases they were grouped ac- 
cording to age levels and each child was 
paired with each other child at the same 
age level if there was a significant dif- 
ference between their scores on the 
Experimental Prognostic Test. Several 
different values were established as “sig- 
nificant differences” in order to observe 
the amount of variance in the predictive 


ability of the test. These significant dif- 
ferences in scores were arbitrarily estab- 
lished as two, three, five, eight, and ten 
times the standard error of measurement. 
On this basis it was usually possible to 
pair each child with a few other children 
at the same age level. The member of 
each pair showing the better performance 
on the prognostic test was predicted as 
the one most likely to gain from the 
special instruction. 

This procedure experimentally dupli- 
cates a very real and practical adminis- 
trative problem in the area of remedial 
reading. The person responsible for se- 
lecting remedial reading cases is faced 
constantly with the question, “Which of 
these two children can profit the most 
from remedial instruction?” In essence, 
this study has experimentally created a 
series of such possible choice situations. 
The statistical purpose of this portion of 
the study was to determine whether the 
proposed instrument would predict, any 
better than chance, which children would 
profit the most from instruction. 

Table 2 shows the results of this por- 
tion of the study. Using two times the 
standard error (2 X s,) as a significant 
difference between prognostic test scores, 
it was possible to set up 44 pairs of 
remedial cases. Of these 44 predictions, 
34 were correct—indicating a predictive 
efficiency of .54 (54% better than chance). 


TABLE 2 
Prepictive Erriciencies or Expert- 
MENTAL PROGNosTic Test witH Cases 
Grovurep BY CHRONOLOGICAL AGE 


(N = 26 ) 
Differen- Correct |Percent- 
Pairs Predictive 
tial (s, predic- | age cor- . 
units) (») tions rect efficiency 
2x 44 34 77 A 
3X 39 31 80 .60 
5X 25 204 82 
8X 17 15 SS .76 
10X 12 ll 92 86 


26 


When higher values are used as a signifi- 
cant difference between scores, the num- 
ber of pairs becomes less and the accuracy 
of prediction steadily increases. Using a 
value of ten times the standard error 
provides 12 pairs and a predictive effi- 
ciency of .86. The steady increase in pre- 
dictive efficiency, as higher values are 
used for discrimination, is further evidence 
that the test is functioning predictively 
in this situation. 

When these same cases are paired on 
the basis of grade level rather than age 
level it is possible to obtain more pairings, 
but there is a decrease in the predictive 
efficiency of the instrument. This is prob- 
ably due to the more heterogenous nature 
of the sample in terms of age. (At one 
time the experimental test was adminis- 
tered to a number of children at various 
ages, and it was found that the number 
of errors rapidly decreases with an in- 
crease in the age of the Ss.) If a value 
of five times the standard error is used 
as a significant difference between scores 
it is possible to set up 35 pairs. This 
yields a predictive efficiency of 60 (as 
compared to .64 for 25 pairs when 
grouped by age). A predictive efficiency 
of 60 was the highest obtained when 
pairings were made on the basis of group- 
ing by grade (as compared to 86 when 
grouped by age). 

Twenty-three of the cases used in this 
portion of the study had had Stanford- 
Binet Intelligence Tests administered at 
some time. Predictive efficiencies were 
computed for these data using the same 
procedure used to predict with the ex- 
perimental test. No predictive value for 
the Stanford-Binet was found in this 
sample. However, since this portion of 
the study was subjected to limited con- 
trol the results do need to be interpreted 
cautiously. One significant point is 
brought out by this comparison. It was 


RICHARD W. 


WOODCOCK 


quite difficult to obtain as large a number 
of pairings from the intelligence test data 
as from the experimental test data. This 
was due to the relatively narrow range 
of performance available for discrimina- 
tion between Ss on the intelligence test 
results as compared to the experimental 
test results. The range of 1.Q.’s was ap- 
proximately 30 points while the range of 
performance on the experimental test was 
approximately 100 points for the same 
Ss. In both instances, quite coincidentally, 
the size of the standard error of measure- 
ment is nearly identical. 


CONCLUSIONS 


Within the limitations of this study it 
would appear that the Experimental 
Prognostic Test for Remedial Readers has 
demonstrated a predictive value in select- 
ing cases for remedial reading instruction. 
Further research will be required in order 
definitely to determine the degree of this 
value. At present, it would appear that 
the test shows a greater possibility for 
predicting success based on gains during 
a period of instruction than the techniques 
in common use. However, it should be 
noted that the research concerning the 
predictive value of the techniques in 
current use is incomplete. 

One is impressed by the paucity of 
research studies designed to consider some 
aspect of the prediction of success in 
remedial reading. Most studies which are 
referred to in this area have been origi- 
nally designed for some other purpose. For 
example, the writing and research which 
provides the basis for the use of intel- 
ligence tests as a predictor of success in 
remedial reading is primarily based on 
studies of concurrent validity obtained 
between intelligence and reading achieve- 
ment with groups of normals. These cor- 
relations have not been based on the 


PROGNOSTIC TEST FOR REMEDIAL READERS 27 


predictive validity between the intelligence 
test scores of retarded readers and their 
subsequent gains in remedial reading. 
This does not mean to say that the studies 
were not appropriate for the purposes 
or_ginally intended by the authors, but 


the application of the results of such 
studies to the problem of selecting cases 
or predicting success in remedial reading 
will remain highly questionable until sub- 
jected to experimental verification. 


Received August 30, 1967. 


JouRnat or EpucaTionaL Psycuotocy 
Vol. 49, No. 1, 1958 


EFFECT OF AN AUDIO-VISUAL PHONICS AID 
IN THE INTERMEDIATE GRADES 


CAROLYN LUSER, EILEEN STANTON, anp CHARLES I. DOYLE 
Loyola University, Chicago 


To test the effect of formal phonies 
drill on a group basis, the present ex- 
periment employed audio-visual aids con- 
sisting of uniform recordings and indi- 
vidual charts for each pupil in four ex- 
perimental rooms.” 

The population was chosen from a lower 
socioeconomic area, with many semitran- 
sients. The sample contained many Ne- 
groes, Puerto Ricans, Mexicans, and a 
scattering of other nationalities. The area 
and population were selected in the hope 
of finding more than the average number 
of handicapped readers. This hope was 
realized when the pretests were scored. 
The average reading scores in all eight 
rooms were below the grade-level expect- 
ancy, and there were many seriously re- 
tarded readers. The number of drop-outs 
and absences confirmed the transient na- 
ture of the population. While nearly 300 
pupils were available for pretests, only 
214 completed the entire battery of tests 
at the conclusion of the experiment. 


*Permission for the use of these four 
schools was graciously given ‘by Peter B. 
Ritzma, Chairman, Special Projects Com- 
mittee, and Don C. Rogers, Assistant Super- 
intendent, Chicago Public Schools, and by 
Rev. David C. Fullmer, Assistant Superin- 
tendent, Chicago Catholic School Board. 
The District Superintendent, Douglas Van 
Bramer, and the principals and teachers in 
the four schools also rendered much coopera- 
tion. 

The authors are indebted to the World 
Book Company for permission to reprint 
Stanford Primary Reading Test, Form D. It 
appeared to the authors that fill-in responses 
would be more valuable for detailed analysis 
than the multiple-choice style adopted in 
the 1953 forms. 

The phonographs, phonics records, and 
pupil charts for the experimental rooms were 
donated by Bremner-Davis Phonics. 


Two public and two parish schools were 
available in the area selected. The grade 
levels chosen for the experiment were the 
third and fourth grades, where reading 
difficulties tend to become more evident, 
and where pupils’ competence to follow 
group test instructions may be assumed 
more safely than at lower levels. 

In each school an experimental room 
and a control room were selected. The 
criterion for the selection of the experi- 
mental room was the lower average IQ 
derived from the Kuhlmann-Anderson 
Test, Form D (sixth edition). This form 
not only fitted the average grade expect- 
ancy of the experimental and the control 
rooms, but had the further advantage that 
the content was about equally divided be- 
tween verbal and nonverbal subtests. 

Besides the intelligence tests, four 
achievement measures were secured for 
each pupil; Gray’s Oral Reading Para- 
graphs, Stanford Primary Reading, Form 
D (paragraph meaning and word mean- 
ing) and the Marion Monroe form (writ- 
ten) of the Ayres Spelling Scale (3). 

After the pretests, each of the four ex- 
perimental rooms received 43 twenty- 
minute sessions of phonics drill with the 
phonograph records and individual pupil 
charts (1). These sessions were spaced 
three times a week for a period of 15 
weeks. The experiment was limited to 15 
weeks because the pretests had to wait 
for mid-year promotions, and the retests 
had to be completed before final examina- 
tions in June. No special motivation was 
given during the drill periods, aside from 
the encouragement offered in the records 
themselves. All the phonics sesions were 
conducted by one of the workers who had 


AN AUDIO-VISUAL PHONICS AID IN INTERMEDIATE GRADES 29 


administered the pretests, who was thus a 
familiar figure to the children. 

At the conclusion of the experiment, the 
entire population, experimental and con- 
trol, was retested with the complete bat- 
tery described above. Retests were con- 
ducted by the same examiners who 
administered the original tests. 


ResuLts 


The data from both batteries were 
analyzed by two methods: by compu- 
tation of the standard error of the gains 
(2), and by chi square. The chi-square 
analysis was based on the assumption that 
a gain in individual scores in the experi- 
mental group substantially greater than 
the average gain evidenced by the control 
group would be valid to determine a 
cutting point. Accordingly, for the achieve- 
ment tests, a gain of .5 grade score in 3.5 
months was set as significant. For the 
K-A IQ, allowing for practice effect much 
greater than that shown by the control 
group, a gain of three IQ points was 
chosen. 


While the chi-square test was a less 
refined and precise statistic than that 
based on standard error, it did in general 
corroborate the findings of the latter. The 
results of both methods are summarized in 
Table 1. 

The gains in three of the four achieve- 
ment scores seem to indicate clearly the 
effectiveness of the standardized audio- 
visual drill. Failure to show a similar 
gain in word meaning is understandable 
in view of the background of the children 
in this sample. A marked gain in word 
meaning would seem to depend on en- 
riched experience and resultant growth in 
vocabulary more than on mastery of 
phonics. 

It seems evident that the unexpected 
gain in IQ points does not indicate a real 
change in intelligence. It does serve to 
point up the fact that performance on a 
group test of intelligence is markedly de- 
pendent on the pupils’ familiarity with the 
printed word. It is suggested, however, 
that improved habits of attention from 
group drill sessions may have contributed 


TABLE 1 
Test Scores or EXPERIMENTAL AND Contro.t Groups BEFORE 


Experimental 


~~\Net Diff. 
| in gain P 


Chi 
square P 


w 


oo #5 BS 


-46 ‘ 13.93 


.37 ‘ 8.57 


1.22 


or FprcaTIonaL 


AND Arter PHonics 
Control | | 
Tests Pre | Post Pre |P 
N = 105 N = 109 | 
Oral M 3.0 | 3.76 2.971 3 > .001 
Reading 1.15) 1.40) 1.13) 1 
Parag. M 2.84 | 3.61) 2.90) 3 01 
Mean. | 1.16 .92 
Word M 2.79 | 2.76) o7 iu 
Mean. .78 .93 
Spell. M 2.79 | 3.26 2.76] | 2.35 02 4.26 05 
.97/1.17] .84 
1Q M 88.3 94.5 | 92.3 | 5.50 | 4.91 | >.001 | 23.28 | >.001 
112 11.9 11.0 
* Not significant 


30 CAROLYN LUSER, EILEEN STANTON, AND CHARLES I. DOYLE 


in part to more successful performance 
on the K-A retests of the experimental 
group. 

SumMaryY 


A sample of 214 third and fourth 
graders from four schools in an under- 
privileged area was divided into two 
groups. A control group was measured 
against an experimental group which re- 
ceived 43 drill sessions in phonics, with 
uniform recordings and individual charts. 
When both groups were retested after 
the experiment, the experimental group 
showed gains on standard tests of oral 
reading, paragraph meaning, and spelling, 


which were significantly greater than the 
gains of the control group. Gain in word 
meaning apart from context was not sig- 
nificant. Retest performance on a paper- 
and-pencil intelligence test also showed a 
marked gain for the experimental group. 
REFERENCES 
1. Bremner, A. J., & Davis, Josernine. The 
sound way to easy reading. Wilmette, 
Ill.: Bremner-Davis Phonics, 1953. 
2. McNemar, Q. Psychological statistics, 
New York: J. Wiley, 1949. 
3. Monroe, M. Children who cannot read, 
Chicago: Univer. of Chicago Press, 1932. 


Received September 19, 1967. 


JournaL or EpucationaL Parcmotoer 
Vol. 49, No. 1, 1058 


PREDICTING SCHOOL ACHIEVEMENT FOR BILINGUAL PUPILS' 


JAMES G. COOPER 


Territorial College, Agana, Guam, Marwnas Islands 


Teachers and other school personnel of 
the Territory of Guam have long been 
aware of the need for adequate predictors 
of school achievement for their bilingual 
pupils. In December of 1956, Guam’s De- 
partment of Education authorized a study 
to discover to what extent, if any, cur- 
rently available tests would provide: such 
predictions. Guam is a territory whose 
cultural patterns are rapidly changing. 
From a Spanish type of chureh-dominated 
society, from a military paternalism, 
Guam is becoming infused with current 
American ideas, trends, and practices. 
The local language, Chamorro-, is slowly 
giving way to English. However, English 
is spoken only in the school classroom, 
infrequently on the playground, and 
rarely in the home and community. Con- 
sequently, because of the unique cultural 
and language factors, currently available 
measures of intelligence must lie in the 
realm of questionable validity until dem- 
onstrated otherwise. 

This study endeavored to determine the 
predictive ability of six tests of intelli- 
gence for certain fifth-grade pupils of 
Guam. Only those tests which were wholly 
or partially performance or nonverbal 
were considered. In order to hold cultural 
factors constant, four relatively isolated 
communities were selected. The villages 
of Inarajan, Merizo, Talofofo, and Umatac 
have had no electricity (and consequently 
no television) until quite recently; no 


180 fifth-grade children distributed among 
six classes. 

The plan of study entailed the follow- 
ing: 

1. Administer three group tests of in- 
telligence to all fifth-grade pupils of the 
four villages. The tests were the California 
Tests of Mental Maturity, 1950 S-Form, 
Elementary; Davis-Eells Games, Inter- 
mediate Level; and the Culture Free In- 
telligence Test, Seale 2, Form A. 

2. Select a stratified, random sample 
of 51 pupils from the larger group. Give 
cach of ‘these pupils the following indi- 
vidual tests: Leiter International Perform- 
ance Seale, Wechsler Intelligence Scale 
for Children, and the Columbia Mental 
Maturity Seale. 

3. Give all pupils the California 
Achievement Tests, Form AA, Elemen- 
tary level. 

4. Obtain teacher ratings for each child 
regarding his school success. 

5. Relate intelligence tests scores to 
achievement test scores and to teachers’ 
ratings (applicable to group tests only). 

6. Study the interrelationships between 
certain tests. 


PROCEDURE 


The plan indicated above was followed: 


The Davis-Eells Games and California 
Tests of Mental Maturity were adminis- 
tered between November 1956 and Feb- 
ruary 1957. The Culture Free Intelligence 


statesiders (persons whose usual abode | Test was given during March of 1957 


is mainland, U.S.A.) as residents; no tele- 
phones; few movies. Books and magazines 
are not commonly found. These four vil- 
lages enrolled a total of approximately 


*A research supported by a grant from 
the James McKeen Cattell Fund. 


and the California Achievement Tests 
given in May 1957. The teachers’ ratings 
were obtained prior to achievement test- 
ing. Individual tests were given from 
February through June 1957 in this se- 
quence: Wechsler Intelligence Scale for 


31 


32 JAMES G. 


Children, Columbia Mental Maturity 
Seale, and Leiter International Perform- 
ance Scale. 

The sample for individual testing was 


drawn via a table of random numbers. 


Each of the six classrooms was permitted 
to contribute its proper share of boys 
and girls. This was necessary because the 
schools of Inarajan and Merizo divided 
their fifth-grade pupils into fast and slow 
groups. Also, these schools enrolled twice 
as many fifth graders as Umatae and 
Talofofo. 


Group Tests 


The principal findings from the three 
group tests of mental ability are shown 
in Table 1. The table shows that 164 
pupils obtained a mean California Test 
of Mental Maturity total IQ of 83.494 
with a standard deviation of 11.087; the 
Pearson correlation coefficient between 
these scores and the California Achieve- 
ment Tests was .644 and the stability of 
this coefficient is indicated by the .99 
confidence limits of .509-.747. These limits 
were computed via the z transformation 


COOPER 


(6, p. 147). Similar data are recorded 
for California Language IQ, Nonlanguage 
1Q, Davis-Eells Games, and Culture Free 
Intelligence Test. The latter test is re- 
ported in raw scores because the IQs 
yielded a distribution severely skewed, 
ie., too many low scores. The use of 
raw scores gave a normal (by eye) dis- 
tribution. 

A positive skew was noticed in the 
distribution of California Nonlanguage 
IQs. However, a chi-square test indicated 
that the obtained frequency distribution 
fitted the normal form adequately (3, 
pp. 284-285). 

The Davis-Eells Games scores were 
considerably lower than the other two 
tests. The possibility existed that this 
may have been caused by the pupils’ 
difficulties in understanding the direc- 
tions. (This test consists of a number of 
pictures about which the examiner makes 
various ~.atements. The subject responds 


by indicating which statement best ap- 
plies to each picture.) This possibility 


was explored by retesting a fifth-grade 
class in Merizo. The second test was given 


TABLE 1 
CorRRELATION CoEFFicIENTS Between Group Tests or INTELLIGENCE 


AND CALIFORNIA ACHIEVEMENT Test Raw Scores 


Group Test 


.99 confidence 
limits (via z) N 


California Test of Mental 
Maturity 
Total IQ 
Language IQ 
Nonlanguage IQ 


Davis-Eells Games (IPSA) 


Culture Free Intelligence 
Test (Raw scores) (this 
yields a mean IQ between 
75 and 78) 


California Achievement 


.509-.747 
-434-.703 
.359-.655 
-369-. 601 


.391-.676 


M 
83.494 11.087 .644 164 
81.024 10.194 . 584 164 
88 .067 17.239 . 522 164 
66.970 11.146 531 164 
20.685 6.355 549 163 
Test 
Total raw score 154.451 36.178 164 


PREDICTING ACHIEVEMENT OF BILINGUAL PUPILS 


wholly in Chamorros. The average in- 
crease of three points lacked statistical 
significance. It was concluded that this 
test measures as well in English as it 
does in Chamorros. 

The matter of sex differences was an- 
alyzed for each of the group tests. None 
of the small differences approached the 
5% level of significance. The mean CA 
of the boys was 12-0 and of the girls 
11-8; the difference of four months was 
significant at 1%, 

Discussion of group tests. The data of 
Table 1 indicate that the California total 
1Q predicted California Achievement Test 
scores fairly well; the schools of Guam 
should consider seriously more widespread 
application of this test. It was interesting 
to note that neither the Language nor 
Nonlanguage IQ’s should be used sep- 
arately. 

Neither the Davis-Eells Games nor the 
Culture Free Intelligence Test offer as 
much promise. However, from the point 
of view of test theory, the fact that these 


33 


two tests do show pronounced positive 
correlations with scores on a typical school 
achievement test is highly significant. In 
other words, the abilities sampled by 
these two measures are those which in 
part determine school success in a_bi- 
lingual setting. 

Group tests and teachers’ ratings. The 
six teachers of the children tested were 
asked to rate their pupils according to 
the directions: “... Divide your pupils 
into three groups. Place the names of 
your best pupils into the top third, the 
poorest into the low third, and the others 
into the middle third.” The relationship 
between this sort and the four group 
tests was established by locating the 
median for each class and preparing a 
3 x 2 table from which chi square could 
be computed. A high value of chi square 
indicates that the teachers’ judgments and 
the pupils’ test positions (above or below 
the medians for their classes) were con- 
gruent. These data appear in Table 2. 


TABLE 2 
Cri-Square Resuttina Compartina Teacuers’ Ratines 
witn Tests or ABILITY AND ACHIEVEMENT 


| 


Total 


| 
| 


Inarajan 
Teacher A 
Teacher B 


Merizo 
Teacher A 
Teacher B 

Talofofo 


Umatac 


Calif. Mental Maturit y 


Non- 
language 


19.08** 
10.11°* 


6.19* 
16.02** 
12.77°* 
17.91°° 


"© Significant at 5% level. 
** Significant at 1% level. 


| Name of Test 
Calif 
ey Culture | Achievement 
Lan- Ils Free 
| 
| 29 9.48** | 1.73 | 11.50°* | 2.82 | 5.93 
7.42° | 4.47 | 6.36% | 2.74 | 2.84 
23 |1.30 | 1.92 | 1.81 |1.60 | 4.71 
32 1.28 4.70 | 3.61 jo 2.44 
4.49 «| 7.63*| 3.52 | 4.77 | 1.07 
26 7.26" | 3.22 | 9.96** | 1.96 | 10.95°* 


JAMES G. 


34 


The table shows that the California 
Test of Mental Maturity IQ’s agreed 
fairly well with the ratings of three 
teachers and that the Nonlanguage IQ 
was as effective as the Total 1Q. This 
finding was surprising as teachers may be 
expected to give more weight to their 
pupils’ verbal behavior than to their non- 
verbal behavior. Neither the Culture Free 
Intelligence Test nor the Davis-Hells 
Games corresponded well with teachers’ 
ratings. The California Achievement Test, 
however, showed a pronounced agreement 
with teachers’ ratings. These latter data 
demonstate a high degree of validity for 
this test, ie., the California Achievement 
Test seems to measure the kinds of 
achievements that these teachers deem 
important when they differentiate be- 
tween successful and less successful pupils. 
These data also suggest that this test 
may possess considerable curricular va- 
lidity. This point, however, should be 
verified by study of both teachers’ opin- 
ions and aetua! curricular materials used 
in these classes. 


Individual Tests 


The results obtained from adininister- 
ing the three individual tests of intel- 


COOPER 


TABLE 3 


ligence and correlating these scores with 
the total raw scores from the California 
Achievement Tests appear in Table 3. 
The table shows that the best prediction 
was given by the Verbal Seale of the 
Wechsler Intelligence Scale for Children. 
The greatest amount of scatter was given 
by the Columbia Mental Maturity Seale 
as shown by its sigma of 24.37; this test 
also produced the highest mean IQ. 

The means of these individual tests 
were well below those given in the re- 
spective manuals. The obtained sigmas 
were three to four IQ points lower than 
those reported for the Wechsler Intel- 
ligence Seale for Children (8), equal to 
the Columbia Mental Maturity Seale’s 
listed 25 1Q points (1), and lower than 
the figures given for the Leiter Interna- 
tional Performance Seale (4). The latter 
was difficult to determine, because’ the 
Leiter manual lacks specificity on this 
point. These data indicate that the IQ 
scores for the Guam sample were dis- 
tributed in a manner similar to the 
standardization groups. 

Discussion of individual tests. The data 
revealed in Table 3 show that the Verbal 
Seale ef the Wechsler Intelligence Seale 
for Children gives a fairly aeeurate pre- 


PEARSON CORRELATION COEFFICIENTS BETWEEN CALIFORNIA ACHIEVEMENT 
Tests, (Raw Scores) anv IQs INpivipuaL Tests 


Full Scale 
Verbal Scale 
Performance Scale 


Columbia Mental Maturity Scale 
Leiter International Performance Seale 
California Achievement Test (raw score, 


sum of all tests) 


(N = 51) 


.99 confidence 
limits (via z) 


M r 


58-88 


| 
Test 
Wechsler Intelligence Seale for Children 
72.89 11.84 77 
71.58 11.30 .63-.90 
77.15 12.70 A 
24.37 .34-.79 
72.78 12.58 .66 .40-.82 
155.39 43.16 


diction of school achievement in Guam’s 
bilingual setting. It was interesting to note 
that the Wechsler Full Scale IQ was not 
quite so efficient a predictor as was the 
Verbal Scale IQ. This finding was not 
supported by comparable evidence con- 
cerning the California Test of Mental 
Maturity reported in Table 1; in the 
latter case, the Language IQ and the 
Nonlanguage IQ predicted about equally 
well. Wechsler states that of the Verbal 
Scale subtests, “Information” and “Com- 
prehension” are poor for those with in- 
adequate verbal facility and that “Arith- 
metic” is influenced by education (7, pp. 
80-82). He also notes that “Vocabulary” 
is affected by schooling (7, pp. 98-99). 
Therefore, it seems reasonable to believe 
that in a bilingual setting, successful 
school achievers will obtain higher scores 
on both the Wechsler Verbal Scale and. 
on the California Achievement Test, ie., 
we may be dealing with common elements 
rather than with intelligence per se. 

The data relating to the Leiter Inter- 


PREDICTING ACHIEVEMENT OF BILINGUAL PUPILS 


TABLE 4 


35 


national Performance Scale are of con- 
siderable concern. This test is expensive 
(about $200), time-consuming to admin- 
ister (from 45 to 90 minutes), and very 
bulky. Its correlation with the California 
Achievement Test, although high (r 
equaled .66), was only slightly higher 
than the relatively inexpensive Columbia 
Mental Maturity Scale. Further, the Co- 
lumbia may be administered in from 10 
to 15 minutes, a factor of real importance 
to many psychometrists. On the positive 
side, however, both the Leiter and the 
Columbia have demonstrated consider- 
able validity for use with these bilingual 
pupils. These data are all the more sig- 
nificant when it is recalled that the Leiter 
is a completely nonverbal test and the 
Columbia is almost so (five cards on the 
Columbia require the ability to read 
words or letters, and five involve nu- 
merals). 

Interrelationships between tests. The 
Leiter International Performance Scale 
and the Columbia Mental Maturity Scale 
are relatively new arrivals to the scene 


INTERCORRELATIONS BETWEEN TESTS 


(N = 51) 


Test 


Leiter International Columbia Mental 
Performance Scale Maturity Scale 


California Test of Mental Maturity 
Total IQ 

Language IQ 

Nonlanguage IQ 


Wechsler Intelligence Scale for Children 
Full Scale IQ 

Verbal Scale IQ 
Performance Scale IQ 
Davis-Eells I.P.S.A.* 


Culture Free Intelligence Test, Raw scores 
(N = 50) 


Columbia Mental Maturity Scale IQ 


* Index of Problem Solving Ability. 


» 
= 
68 
62 
66 -60 
83 74 
.73 
78 -68 
.72 69 
» 
— 


36 


of mental measurements. In order to 
clarify the nature of their functioning, 
their scores were correlated with each 
other and with the other tests of mental 
ability utilized in this study. The results 
are shown in Table 4. The table indicates 
that the Leiter measures quite consis- 
tantly with the other tests, since the cor- 
relation coefficients ranged from 62 
(California Test of Mental Maturity, 
Language IQ) to 83 (Wechsler Full Scale 
IQ). The Columbia followed a pattern 
similar to the Leiter, but the coefficients 
were somewhat lower. 


SuMMARY AND CONCLUSIONS 


This study was undertaken to ascer- 
tain to what degree, if any, currently 
available measures of intelligence predict 
school achievement for the bilingual pu- 
pils in the Territory of Guam. Three 
group tests, the California Test of Mental 
Maturity, 1950 S-Form, Elementary; the 
Davis-Eells Games, Intermediate Level; 
and the Culture Free Intelligence Test, 
Scale 2, Form A were given to 164 pupils 
in grade five. Three individual tests of 
intelligence: the Leiter International Per- 
formance Scale, the Wechsler Intelli- 
gence Scale for Children, and the Co- 
lumbia Mental Maturity Seale were given 
to a stratified, random sample of 51 pu- 
pils. School achievement was defined pri- 
marily by scores received on the Cali- 
fornia Achievement Tests, Form AA, 
Elementary Level, and secondarily by 
teachers’ ratings. 

All the intelligence tests correlated posi- 
tively with the California Achievement 
Tests. The correlation coefficients ranged 
from .53 to .77 as follows: Davis-Eells 


JAMES G. 


COOPER 


Games, 53; Culture Free Intelligence 
Test, 55; Columbia Mental Maturity 
Seale, 61; California Tests of Mental 
Maturity, .64; Leiter International Per- 
formance Scale, .66; and the Wechsler 
Intelligence Scale for Children, Full Scale, 
77. 

Although teachers’ ratings corresponded 
well with rank on the achieverient test, 
they were not closely related to scores on 
the group intelligence tests. 

This study demonstrated that the six 
intelligence tests examined predicted 
school success with a degree of accuracy 
ranging from moderate to high for 
Gvam’s bilingual pupils. 

REFERENCES 


1. Burcemetstern, B. B., Brum, L. H, & 
Lorce, I. Manual, Columbia Mental 
Maturity Scale. Yonkers-on-Hudson, 
N. Y.: World, 1954. 

. Carrer, R. B. & Carret, A. K. 8. 
Handbook for the Culture Free Intel- 
ligence Test, Scale 2. Champaign, IIl.: 
Institute for Personality and Ability 
Testing, no date. 

. Guitrorp, J. P. Fundamental Statistics in 
psychology and education, (2nd ed.) 
N. Y.: McGraw-Hill, 1950. 

. Lerrer, R. G. Part I of the manual for 
the 1948 revision of the Leiter Interna- 
tional Performance Scale. Chicago: C. 
H. Stoelting, no date. 

. Lanpguist, E. F. Statistical analysis in 
educational research. N. Y.: Houghton 
Mifflin, 1940. 

. McNemar, Q. Psychological statistics, 
(2nd ed.) N. Y.: Wiley, 1955. 

. Wecuster, D. The measurement of adult 
intelligence. Baltimore: Williams & 
Wilkins, 1944. 

. Wecuster, D. Wechsler Intelligence 
Scale for Children; Manual. N. Y.: 
Psychological Corp., 1949. 


Received October 1, 1967. 


ov EpucaTionaL 
Vol. 49, No. 1, 1958 


THE EFFECT OF CHILD PSYCHOLOGY ON ATTITUDES TOWARD 
PARENT-CHILD RELATIONSHIPS’ 


FRANK COSTIN 
University of Illinois 


While most psychologists who teach in 
our colleges and universities are very 
much interested in changing their stu- 
dents’ attitudes, few of them are report- 
ing studies of the problem. In their review 
of research in the teaching of psychology, 
Birney and McKeachie (1) show the 
paucity of this kind of investigation. Fur- 
thermore, most of the attitude studies 
which they describe deal with the intro- 
ductory course. Little is being done to 
discover the nature of attitudes related to 
advanced courses in psychology, either on 
the graduate or undergraduate level. The 
study presented in this paper is intended 
to help close that gap. 

A second-level course which is becoming 
increasingly popular with undergraduates 
is child psychology. The students who 
take this course vary considerably in their 
educational and vocational goals, but al- 
most all of them share at least one com- 
mon objective: they hope to learn some- 
thing about children which will help them 
as future parents. They are especially in- 
terested in learning how their own atti- 
tudes might affect their children’s develop- 
ment. It is important, therefore, that 
those who teach this course find out’ what 
kinds of attitudes toward parent-child 
relationships their students have, and 
how these attitudes change when they 
study child psychology. As a step in this 
direction, a recent pilot study by Hurley 
and Laffey (2), involving 19 students, 
concluded that a 10 week’s child psy- 
chology course at Michigan State Univer- 
sity succeeded in making these students 


*A briefer version of this paper was read 
on September 4, 1957, at the convention of 
the American Psychological Association in 
New York City. 


37 


less rejecting in their attitudes toward 
children. There was no change in over- 
protecting attitudes. 

The present paper reports a more ex- 
tensive investigation of the same kind of 
problem explored by Hurley and Laffey. 
It attempts to answer this question: To 
what extent can an undergraduate, one- 
semester course in child psychology change 
students’ attitudes toward parent-child 
relationships? 


METHOD 


The subjects of this study consisted of 
four different classes totaling 157 stu- 
dents. None of them had previously taken 
a course in either child psychology, child 
development, or family relations. They 
represented a variety of majors within 
the College of Liberal Arts and Sciences 
at the University of Illinois, and were also 
drawn frum other colleges within the 
university. 

The content of the course was the same 
for each class, with emphasis on problems 
of parent-child interaction. All classes 
were taught by the same instructor. In 
addition to the use of a basic text and 
lectures as sources of information and at- 
titudes, films were shown and discussed. 
Supplementary readings also helped to 
extend the variety of course content. 

A parent-child attitude scale was ad- 
ministered to each class at the beginning 
and end of the course. The scale was also 
given at the beginning and end of the se- 
mester to 155 undergraduates at the same 
university who were enrolled in a one se- 
mester introductory course in sociology. 
This was done to see whether or not 
changes which might occur in the psychol- 


38 FRANK COSTIN 


ogy students could be simply the result 
of general college living rather than tak- 
ing child psychology. Like the psychology 
classes, the sociology students represented 
a variety of majors within the College of 


Liberal Arts as well as other colleges., 


They were approximately the same as the 
psychology students in age and college 
class. None had ever taken a course in 
child psychology, child development, or 
family relations, nor was any taking one 
at the times the scale was administered. 

The attitude scale was a slightly modi- 
fied version of one constructed and vali- 
dated by E. J. Shoben (4). Ten items 
from Shoben’s scale were omitted, be- 
cause they represented a miscellaneous 
group which lacked homogeneity, and 
therefore were not as meaningful for this 
study as the other items. In all other 
respects the scale was identical with Sho- 
ben’s. It consisted of 75 statements about 
parent-child relationships to which stu- 
dents responded by strongly agreeing, 
mildly agreeing, mildly disagreeing, or 
strongly disagreeing. The scale measured 
three kinds of attitudes: dominating (40 
items), possessive (20 items), and ig- 
noring (15 items). 

Dominating attitudes “reflect the ten- 
dency on the part of the parent to put 
the child in a subordinate role, to take 
him into account quite fully but always 
as one who should conform completely to 
parental wishes under penalty of severe 
punishment” (4, p. 137). The following 
statements from the scale represent this 
kind of attitude: “A child should have 
strict discipline in order to develop a fine, 
strong character.” “It is sometimes neces- 
sary for the parent to break the child’s 
will.” 

Possessive attitudes “reflect the ten- 
dency to ‘baby’ the child, to emphasize 
unduly (from a mental hygiene view- 
point) the affectional bonds between par- 
ent and child, to value highly the child’s 
dependence on the parent, and to restrict 


the child’s activities to those which can 
be carried on in his family group” (4, p. 
137). Statements illustrating this attitude 
are: “The best child is one who shows 
lots of affection for his mother.” “Children 
should always be loyal to their parents 
above anyone else.” 

Ignoring attitudes reflect “the ten- 
dency on the part of the parent to disre- 
gard the child as an individual member of 
the family, to regard the ‘good’ child as 
one who demands the least parental time, 
and to disclaim responsibility for the 
child’s behavior” (4, p. 137). Sample 
statements representing this kind of at- 
titude are the following: “Parents cannot 
help it if their children are naughty.” 
“Quiet children are much nicer than little 
chatter-boxes.” 

All seoring of responses followed Sho- 
ben’s weighting system, a procedure he 
had found to result in parents of problem 
children making significantly higher 
scores, on the average, than parents of 
nonproblem children. Scores were inter- 
preted as follows: The higher the score, 
the more intense is the attitude; the lower 
the score, the less intense is the attitude. 


RESULTS 


As Table 1 indicates, the psychology 
students expressed a significant decrease 
in the intensity of their attitudes, as 
measured by the total scale. The mean 
change was 7.63 scale points. This shows 
that in general the attitudes of the psy- 
chology students toward parent-child re- 
lationships became more permissive by 
the end of the course. 

The psychology classes also changed 
significantly in each of the three attitude 
dimensions which made up the total scale. 
Table 1 shows that the greatest change 
was in dominating attitudes, with a mean 
decrease of 5 scale points. Changes in pos- 
sessive and ignoring attitudes were 
smaller, being 1.51 and 1.12 seale points 
respectively. 


39 


EFFECT OF CHILD PSYCHOLOGY ON ATTITUDES 


These changes were evidently the re- and psychology students shows that both 
sult of taking child psychology, since an groups expressed essentially the same in- 
analysis of the sociology classes showed tensity of attitudes at the beginning of 
no significant change in attitudes. Table their respective courses. Only the psychol- 
2 reveals this lack of change between the ogy students, however, changed signifi- 
pretest and posttest scores. A comparison cantly. This fact supports the conclusion 
of the initial attitudes of the sociology that the psychology students changed as 


TABLE 1 


Attitupes Towarp PARENT-CHILD RELATIONSHIPS BEFORE 
AND Arter Course IN CHILD PsycHoLoGcy 


(N = 157) 
Before course After course _ 
Attitude measured t 
Mean SD Mean SD in means 


score score 


Dominating 154.55 | 11.23 | 149.55 | 10.18 5.00 7.46* 
Possessive 73.50 | 6.30; 71.99 | 5.82 1.51 3.75* 
Ignoring 52.40 | 3.87 51.28 | 3.86 1.12 3.29* 
Total scale 16.30 | 272.82 | 15.93 7.63 e 


Note.—The higher the score, the more intense is the attitude. 
*p< Ol. 


TABLE 2 


Attitupes Towarp Parent-Cai_p RELATIONSHIPS OF STUDENTS 
Wuo Dip Nor Taxe Course 1n 


(N = 155) 


Before course 


Attitude measured ™ 


Dominating 154.36 | 11.12 | 154.24 | 12.29 .12 17 
Possessive 74.44 | 5.66 74.21 | 5.04 .23 56 
Ignoring 51.41 4.45 51.58 | 4.27 17 .45 
Total scale 280.05 | 16.23 6.74 ‘ 


Note.—The higher the score, the more intense is the attitude. 


TABLE 3 


Attirupes Towarp PaReNtT-CHILD RELATIONSHIPS OF STUDENTS ACHIEVING IN 
Upper anp Lower Hatves or Course 1n PsycHo.ocy 


Before course After course 
Difference t 
Mean Mean in means 
score | SD score SD 
Upper half (N = 79) 275.84 | 14.56 | 268.544 | 14.46 7.30 5.54° 
Lower half (N = 78) 285.65 | 16.11 | 277.29 | 15.93 8.36 


Note.—The higher the score, the more intense is the attitude. 
*p< 


» 
| | After course | 
ifference t 
means 
score ore 
» 


40 


FRANK COSTIN 


TABLE 4 


Men AND WomeEN Stupents’ Atrirupes Towarp PARENT-CHILD RELATIONSHIPS 
Berore anp Arter Course IN CHILD PsycHoLocy 


Before course 
(total scale) 


After course 


(total scale) Difference 


SD 


Mean 
| 5D 


Men (N = 65) 
Women (N = 92) 


15.83 
16.50 


272.79 
273.02 


15.73 8.26 
15.91 7.12 


Note.—The higher the score, the more intense is the attitude. 


*p< 01. 


a result of taking a course in child psy- 
chology, and not simply because of gen- 
eral college experiences. 

In order to discover what relationship 
might exist between scholastic achieve- 
ment in child psychology and changes in 
attitudes, a comparison was made be- 
tween students achieving in the upper 
half of their class and those achieving in 
the lower half. The basis for comparison 
was scores obtained on objective examina- 
tions covering course content, including 
a comprehensive final examination. Re- 
sults of this analysis are summarized in 
Table 3. Initially the upper half of the 
class was more permissive than the lower 
half. Both groups changed significantly. 
The amount of change, however, for each 
group was the same. (The actual differ- 
ence of 1.06 has a t value of .59.) 

Did men and women students differ in 
their attitude changes? The answer to 
this question can be found in Table 4. 
Initially, the attitudes of both groups 
were approximately the same. Both men 
and women showed a significant decrease 
in attitude intensity. The amount of 
change, however, for each group was the 
same. (The actual difference of 1.14 has 
a t value of .59.) 


Discussion 


At the conclusion of a course in child 
psychology, students expressed more per- 
missive attitudes toward parent-child re- 


lationships than they had held at the be- 
ginning of the semester. While they 
changed significantly in all three kinds of 
attitudes measured, their dominating at- 
titudes decreased the most. Why did the 
greatest change occur in that particular 
area? There are several factors which 
should be considered in answering this 
question. All of them, to some extent, 
probably played a part in effecting 
change 


First, it may be that for the kind of 
population which made up the psychology 
classes, dominating attitudes are more 
susceptible to change through formal edu- 
cation than are possessive and ignoring 
attitudes. Most of these students were 
not very far removed in time and memory 
from their adolescence. Since problems of 
parental domination are so outstanding 
in typical adolescent-parent relationships, 
it may be that recent sensitization made 
these students more keenly aware of such 
problems. Thus they would be ready to 
react to any course content which de- 
picted the effects overdominating parents 
have on children. 

A second important factor is that the 
students may simply have examined more 
course content dealing with dominating 
attitudes, and thus were able to change 
more. Of course, this possibility is closely 
related to the first. They may have en- 
countered more material dealing with pa- 
rental domination because they were more 


EFFECT OF CHILD PSYCHOLOGY ON ATTITUDES 41 


ready to react to it, and therefore sought 
it out more often than information deal- 
ing with possessive and ignoring attitudes. 
(The instructor of the course is satisfied 
that neither his lectures nor assignments 
were more heavily weighted with material 
dealing with dominating attitudes than 
with other kinds of parent-child relation- 
ships.) 

A third likely reason for the greatest 
change being in dominating attitudes lies 
in the nature of the attitude scale. It will 
be recalled that the dominating area con- 
tained 40 items, as compared with 20 for 
possessive and 15 for ignoring attitudes. 
Thus, the dominating area of the scale 
afforded a wider range of possible re- 
sponses, and therefore more opportunity 
to express change. Furthermore, this part 
of the seale had a higher correlation with 
the total scale (86) than did the other 
two attitude categories (4, p. 131). It is 
also significant to note that when Shoben 
compared the attitudes of parents of 
problem children with parents of non- 
problem children, he found the greatest 
difference in the two groups to be in 
dominating attitudes (4, p. 132). 

The fact that low achievers changed as 
much in their attitudes as high achievers 
confirms something teachers have strongly 
suspected for a long time: Grades don’t 
tell them all they would like to know 
about what students obtain from a 
course! However, had measurements of 
scholastic progress other than objective 
examinations been used, the relationship 
between change in attitude and achieve- 
ment might have been different. 

Evidently the course had the same ef- 
fect on the attitudes of men and women 
students, both groups becoming more per- 
missive to the same extent. This is a de- 
sirable outcome, since it is a commonplace 
that both parents play important parts in 
affecting their children’s development. 

Considering the great amount of delib- 
erate emphasis on parent-child relation- 


ships built into the course, the investiga- 
tor had anticipated greater changes in 
attitudes than those discovered. But per- 
haps it is unrealistic to expect in one se- 
mester much more change than this. 
Then, too, these measured changes may 
be only the initial stimulation for greater 
changes which will take place later in the 
students’ development. 

In any case, however, studies like this 
one would probably be more meaningful 
if they measured more specific attitudes 
than those represented by Shoben’s three- 
fold classification. The investigator is now 
in the process of doing this by using an 
inventory recently developed by Schaefer 
and Bell (3). Their instrument is an im- 
provement over Shoben’s in several re- 
spects, an important one being that its 
attitude areas, delineated on the basis of 
factor analysis, are more specific and ho- 
mogeneous than the areas used by Shoben. 
It is hoped that others interested in meas- 
uring attitudinal outcomes of child psy- 
chology will also try out this instrument. 


SumMMARY 


Will an undergraduate course in child 
psychology change attitudes toward par- 
ent-child relationships? To answer this 
question, an attitude scale was adminis- 
tered to 157 students before and after 
the course. Postcourse attitudes were sig- 
nificantly less dominating, possessive, and 
ignoring. The greatest change was in dom- 
inating attitudes. Scholastic achievement 
in the course was not significantly related 
to the amount of change. Men and women 
did not differ significantly in how much 
they changed. As a control measure, the 
same scale was administered to 155 stu- 
dents before and after a course in intro- 
ductory sociology. No change in attitudes 
occurred. 


REFERENCES 


1. R., & McKeacnm, W. The 
teaching of psychology: A survey of 


42 FRANK COSTIN 


research since 1942. Psychol. Bull., 1955, Bethesda, Md.: Nat. Inst. of Mental 


52, 51-68. Healti:, Child Development Section, 

2. Huruey, R., & Larrey, J. Influence of a no date (Mimeo.). 
conventional child psychology course 4. SHopen, E. J., Jn. The assessment of pa- 
upon attitudes toward children. Papers rental attitudes in relation to child 
of the Mich. Acad. Sci., Arts, and Let- adjustment. Genet. Psychol. Monog., 
ters, 1957, (1956 meeting), 299-306. 1949, 39, 101-148. 

3. Scuaerer, S., & Bet, R. Q. Parental at- 


titude research instrument (PARI). Received October 7, 1967. 


Journat or EpucationaL 
Vol. 49, No. 1, 1958 


THE DIFFERENTIAL EFFECT OF MANIFEST 
ANXIETY ON TEST PERFORMANCE’ 


EVAN W. PICKREL 
Air Force Personnel and Trainirig Research Center 


Though the Taylor-Spence anxiety- 
drive concept has received considerable 
attention in the experimental literature, 
there is a significant lack of information 
pertaining to the effect of this variable 
on psychometric performance. The thesis 
of this study, in accordance with evidence 
available in the experimental literature, is 
that manifest anxiety level may be shown 
to have a differential effect on various 
phychometric measures; and further, that 
this differential effect may be shown to 
vary with the number of alternatives 
available for decision-making, the amount 
of habit interference possible within the 
task. The anxiety-drive hypothesis essen- 
tially states that the higher the anxiety 
score on the manifest anxiety scale (6), 
the stronger the excitatory potential and 
the greater the response strength. For ex- 
ample, a group scoring high on the scale 
has been demonstrated (3, 4, 5) to be 
superior to a low scoring group in the 
amount of conditioning exhibited. In more 
complex learning situations, however, in 
which there are a number of alternative 
competing tendencies, it has been shown 
(1) that the effect of increasing a drive 
would depend upon the initial response 
hierarchy and the relative habit strength 
of the correct or goal attaining response in 
the hierarchy. Differences in drive level 
may lead to superior performance by 
either the anxious or the nonanxious group 


* This report is based on work done under 
ARDC Project No. 7719, Task No. 17011, 
in support of the Research and Development 
Program of the Air Force Personnel and 
Training Research Center, Lackland Air 
Force Base, Texas. Permission is granted 
for reproduction, translation, publication, 
use, and disposal in whole and in part by 
or for the United States Government. 


43 


depending on the difficulty of the choice 
points. 

It is therefore assumed that there would 
be no significant difference in the per- 
formance of anxious and nonanxious Ss 
on simple speeded measures which pose 
problems not having alternative com- 
peting tendencies. However, when the test 
problems offer a minimal number of al- 
ternative competing tendencies and thus 
are parallel to the experimentalist’s con- 
ditioning problem in terms of possible 
habit interference, the performance of 
anxious Ss will be superior to that of non- 
anxious Ss. Furthermore, when the meas- 
ures become quite complex and each 
problem offers a number of alternative 
competing tendencies, the performance of 
nonanxious Ss will be superior to that of 
anxious Ss. 


METHOD 
Measures 


Four measures were selected to test the 
hypotheses. Answer Sheet Marking Test 
B1710AX was chosen as a simple repeti- 
tive task which does not bring forth any 
alternative competing choices for solution 
of the problems. The test is used to de- 
termine how quickly and accurately Ss 
can locate and mark designated responses 
on a 15-choice IBM answer sheet. There 
are two parts, each presenting 75 ran- 
domly distributed items. 

Army Clerical Speed ACS-2 was selected 
as a measure in which the test problems 
offer a minimal number of alternative 
competing tendencies. The test presents 
125 four- to seven-digit numbers paired 
with either their reversals or a reasonable 
approximation. The task is to state 


» 


44 EVAN W. PICKREL 


whether an exact reversal of the stimulus 
number is presented. 

A Code Learning Test was developed 
to parallel the complex learning situations 
often used in experimental laboratories. 
The test utilizes a tape recorded and IBM 
answer sheets for group administration, 
and teaches the first 10 letters of Inter- 
national Morse Code by presenting each 
signal twice, testing at the end of the se- 
quence, and repeating the procedure 10 
times. Part scores were obtained for each 
of the 20-item subtests. All Ss reporting 
previous experience with the code were 
eliminated from the experiment. 

A simple, nonspeeded arithmetic achieve- 
ment test was selected as a complex meas- 
ure of overlearning. California Achieve- 
ment Test II, a subtest 4, section D of 
parallel forms AA, BB, and CC were used. 
Each form presents 20 addition problems 
varying in complexity from the summation 
of: 2 two-digit to 4 four-digit whole num- 
bers, decimals, fractions and complex num- 
bers when presented in columnal and linear 
array. Scores were obtained for each paral- 
lel form of the test. 


Subjects 


All Ss were newly inducted basic airmen 
entering Lackland Air Force Base from 
November 29, 1955 to January 20, 1956. A 
total of 723 Ss were administered a modi- 
fied form of the Taylor scale which in- 
cluded the 50 anxiety items of the original 
seale plus the Lie and Manic scales from 
the Minnesota Multiphasic Personality In- 
ventory. The Lie scale was used to detect 
false scores, a score of eight or above 
eliminating an S from selection. The anx- 
ious and nonanxious groups each consisted 
of those whose scores fell respectively in 
the upper and lower 20% of scores for a 
standardization population of about 1000 
basic airmen. The cutoffs were 21 and 
higher, 10 and lower respectively on the 
manifest anxiety scale. There were 178 Ss 


in the anxious group and 159 Ss in the non- 
anxious group. 

Anxious and nonanxious Ss from each 
flight of approximately 60 men were pre- 
sented the above measures in the order 
described. 


REsuLts AND Discussion 


The means and standard deviations for 
anxious and nonanxious groups on the 
Answer Sheet Marking and Army Clerical 
Tests are presented in Table 1. The anal- 
ysis of variance technique was used to 
test for significant differences between per- 
formances of the two groups on the com- 
plex learning and achievement measures. 
This information is presented in Table 2. 

In all cases the hypotheses were sub- 
stantiated. There was no significant differ- 
ence in the speed and accuracy with which 
anxious and nonanxious Ss could locate and 
blacken specified spaces on an IBM answer 
sheet. The mean performance score for 
anxious Ss was significantly higher than 
that of nonanxious Ss on the Army Clerical 
Speed Test. On the more complex activ- 
ities, the Code Learning Test and the 
arithmetic achievement tests, the mean 
performance scores of nonanxious Ss were 
significantly higher than those of anxious 
Ss 


Performance on the Army Clerical Speed 
Test parallels the results obtained with 
conditioning problems in the experimental 
laboratories. On the more complex ac- 
tivities, the code learning and arithmetic 
achievement tests, performance parallels 
the results obtained with more complex 
learning situations in the experimental 
laboratories. Since these results parallel 
those found with conditioning and complex 
learning situations, the appropriate place- 
ment of the control task on a complexity 
continuum is open to question. The amount 
of habit interference possible within this 
task does not seem as great as within the 
Army Clerical Speed Test. Yet perform- 


MANIFEST ANXIETY AND TEST PERFORMANCE 


TABLE 1 


Groups 


Anxious (V = 178) |Nonanxious (N = 159) 


Mn SD 


109.51 
70.70 


Answer Sheet Marking 
Army Clerical Speed 


112.04 
63.34 


23.24 
26.47 


23.92 
27.75 


* Significant at the 02 level. 


TABLE 2 


ANALYSIS OF VARIANCE FOR TESTING SIGNIFICANT DIFFERENCES 
Between PERFORMANCE OF THE Two Groups 


Source 


Between groups 

Between trials 

Between individuals within groups 
Groups by trial interaction 
Residual 


* Significant at the 01 level. 


ance on the Army Clerical Speed Test 
paralleled the results of conditioning ex- 
periments. Either the control task is mis- 
classified on a complexity continuum or 
the Taylor Scale does not function as a 
measure of drive in this situation. A study 
has been initiated to provide the answer 
to this question. 

Previous studies have shown that an 
extremely high scoring group on a mani- 
fest anxiety scale is constantly superior to 
a low scoring group in the amount of con- 
ditioning exhibited but that in more com- 
plex learning situations a high drive level 
could result in an impairment of perform- 
ance. In this study a control task was also 
introduced, a situation in which there was 
no significant difference between the per- 
formance of anxious and nonanxious Ss. 
Thus the differential effect of manifest 
anxiety on performance was shown with 
psychometric variables to be a function 
of the number of alternatives for decision- 


making given to S. Since results were ob- 
tained which are consistent with those 
found in experimental literature, the study 
has demonstrated a parallelism between 
psychometric measures and some types of 
variables studied in experimental labora- 
tories. 


SuMMARY 


Groups of anxious and nonanxious Ss, as 
determined by a scale of manifest anxiety, 
were administered psychometric tasks 
which varied in the number of alternatives 
for decision-making given to S. There was 
no significant difference in the performance 
of the two groups on a control measure 
which did not provide S with alternatives 
for decision-making. When the task pro- 
vided a minimal number of alternatives, 
mean performance of the anxious group 
was superior. When the task was complex 
and a number of alternatives were made 
available, mean performance of the non- 


| 
45 
MEAN PERFORMANCE Scores 
Measure | t 
.98 
2. 
| = | 
Code Learning Tests Calif. Achiev. Tests 
df Variance df Variance 
1 1 114.08" 
9 1396 .94* 2 130.99°* 
335 105 .87* 335 40.02* 
y 3.35 2 2.02 
3015 5.29 670 2.96 
_» 


46 EVAN W. PICKREL 


anxious group was superior. Results were 
obtained which are consistent with those 
found in the experimental literature, thus 
demonstrating a parallelism between psy- 
chometric measures and some types of 
variables studied in experimental labora- 
tories. 


REFERENCES 


1. Farser, I. E., & Spence, K. W. Complex 
learning and conditioning as a function 
of anxiety. J. erp. Psychol., 1953, 45, 
120-125. 

2. Sarason, S. B., Manpier, G., & Cratc- 
Hu, P. C. The effect of differential 
instructions on anxiety and learning. 


J. abnorm. soc. Psychol., 1952, 47, 561- 


565. 

3. Spence, K. W., & Taywor, Janer. Anxiety 
and strength of the USC as determiners 
of the amount of eyelid conditioning. 
J. exp. Psychol. 1951, 42, 183-188. 

4. Spence, K. W., & Taywor, Janet. The 
relationship of anxiety level to per- 
formance in serial learning. J. ezp. 
Psychol., 1952, 44, 61-64. 

5. Tayvor, Janet. The relation of anxiety to 
the conditioned eyelid response. J. ezp. 
Psychol. 1951, 41, 81-92. 

6. Taytor, Janet. A personality scale of 
manifest anxiety. J. abnorm. soc. Psy- 
chol., 1953, 48, 285-290. 


Received October 11, 1967. 


Journat or Epucationat 
Vol. 49, No. 1, 1958 


SOME SOCIOECONOMIC CORRELATES OF ACADEMIC APTITUDE 
KHOSSROW MOHANDESSI anv PHILIP J. RUNKEL 


Bureau of Educational Research, University of Illinois 


It is usually agreed that the socio- 
economic status of the family and com- 
munity in which children and youth grow 
up affects their learning behavior and 
school achievement (eg., 2, 3, 5, 10). 
Davis, for example, explains the influence 
of social class in these words: “By de- 
fining the group with which an individual 
may have intimate clique relationships, 
our social class system narrows his train- 
ing environment. His social instigations 
and goals, his symbolic world and its 
evaluation are largely selected from the 
narrow culture of that class with which he 
can associate freely” (2, p. 609). 

In the present study, academic aptitude 
of students in Illinois high schools is com- 
pared with some of the socioeconomic 
characteristics of various communities in 
which the schools are located. Variables 
chosen for this purpose were those for 
which there was ready-to-hand informa- 
tion and which tended to “narrow the 
culture” of the people in the community. 
They are the distance from large towns 
and cities, community population, dis- 
tance from the nearest active coal mine, 
the value of farm products sold, the value 
of land and building per farm, the school 
size, and whether the school is public or 
private. Obviously, these are not direct 
measures of the psychological effects of 
family and community but rather are 
rough indices which we presume will be as- 
sociated with psychological processes which 
in turn are related to academic aptitude. 
While these variables are not of the kind 
to please the theoretician, they have the 
advantage for the practical worker that 
the necessary information is readily avail- 
able. 

Although this study was made only of 
schools within the state of Illinois, there is 


evidence that the relationships reported 
here would be typical of those throughout 
the nation. In a nationwide sampling of 
high schools, Mollenkopf and Melville (7) 
found that academic aptitude and achieve- 
ment were related to such variables as re- 
gion (South or non-South), community 
size, percentage of fathers who were high 
school graduates, instructional support per 
pupil, percentage of support from state 
aid, and whether the region served by a 
school had a public library. 

Somewhat similar results were obtained 
by Thorndike (8) in his study of com- 
munity variables as predictors of intel- 
ligence and academic achievement. Cor- 
relations between IQ and various indices of 
the education of adult population ranged 
from .33 to .43 and between IQ and differ- 
ent measures of cost of housing from 30 
to .32. A correlation of .28 was obtained 
between IQ and proportion of native-born 
whites, one of —.26 between IQ and rate 
of female employment, and one of .28 
between IQ and frequency of professional 
workers in community. 

In the present study, the academic ap- 
titude of the students was measured by 
the portion of the Differential Aptitude 
Tests (1) which is used as a part of the 
Illinois Statewide High School Testing 
Program. The score used is the Total 
score from the Abstract Reasoning and 

erbal Reasoning Tests. This score will be 
referred to as the “D.A.T.” in this paper. 
We will also talk of the Illinois Statewide 
Testing Program as the “Program.” 

From the 1955-56 Program we have 
figures available showing the mean D.A.T. 
for each school computed from the per- 

* This prepublication memorandum is 
-_ with the kind permission of the au- 

ors. 


47 


48 KHOSSROW MOHANDESSI AND PHILIP J. RUNKEL 


centile scores of the pupils in that school. 
Using this information, we chose as our 
“dependent” variable an extreme dichot- 
omy; namely, whether the school mean 
lies in the top quarter of all schools or in 
the bottom quarter. The second and third 
quarters are ignored.’ It should be re- 
membered that not all high schools in the 
state appear in the Program and corres- 
pondingly in this analysis. The schools in- 
cluded are those participating in the State- 
wide Testing Program and having school 
means on the D.A.T. falling either in the 
highest or the lowest quarters. 


HyporuHeEsis 


The purpose of this study was to check 
some rough indices of the socioeconomic 
characters of different communities in the 
state of Illinois (or of the schools which 
are located in those communities) against 
the mean academic aptitude of the stu- 
dents sent to schools in those communities. 
Our general hypothesis is that some of the 
factors which determine aptitudes in 
schools in various communities are to be 
found among the socioeconomic character- 
istics of those communities. We would 
suppose that, while so e of this relation- 
ship is due to the efficacy with which cer- 
tain types of social situations foster the 
growth of academic ability, another part 
of the relationship is due to the self-selec- 
tion of the kinds of families which move 
into the various kinds of communities. 
Our prediction in each case below stems 
from the expectation that the more a com- 
munity is able to support its schools, the 
greater are the chances that the students 
will show the higher ranges of aptitude. 

*Since cases falling in the second and 
third quarters are not examined, conclu- 
sions cannot be drawn about these quarters. 
We can only draw conclusions about whether 
a variable (such as distance from larger 
towns) is associated with the school’s fall- 
ing in the first or fourth quarter. 


TABLE 1 
Mean D.A.T. Versus Distance FROM 
Nearest Town oF 25,000 on LarcER 
(N = 167) 


er of mean 
Distance of schools from .A.T. of school 


towns of 25,000 or 


First Fourth 


Less than 10 miles 47 12 
Over 10 to 25 miles 35 20 
More than 25 miles 12 41 


Note.—p < .001 by chi square for 2 d/. 


Frnpincs 


A description of the socioeconomic meas- 
ures used in the study and the results 
obtained in each case follow. 


Distance from Town of 25,000 or Larger 


First examined was the distance of each 
school from the nearest town of 25,000 
or more in population. These distances 
were scaled from an ordinary current high- 
way map. Communities farther from the 
larger towns, we felt, would be more rural 
in character. Table 1 shows the distri- 
bution of schools in the first and last 
quarters of the mean D.A.T. scores ac- 
cording to their distance from the nearest 
town of 25,000 or more. The mean dis- 
tance from such towns is 12 miles for the 
schools in the first (highest) quarter of 
mean 1).A.T. scores and 29 miles for the 
schools in the fourth (lowest) quarter. 
The chi-square test applied to the data 
of Table 1 gives a value significant at the 
001 level and supports the prediction 
made.” 

It should be noted, in regard to Table 1 
and all other tables, that we chose the 
categories of the independent variable so 


* We report in each case the p value which 
is the smallest the table lists for the ob- 
tained value of chi square. We report these 
values so that the reader may choose for 
himself the level of probability which he 
wishes to consider statistically significant. 


er |- 
i 


SOCIOECONOMIC STATUS AND APTITUDE 49 


as to keep the frequencies in each category 
as equal as might be while at the same time 
avoiding expected frequencies too small to 
satisfy the requirements of the chi-square 
test. 


Community Size 


A second variable used was the popu- 
lation of the communities in which the 
schools with higher and lower D.A.T. 
means were located. The assumption was 
that larger centers of population would 
be able to spend more money on their 
schools than the smaller ones. The relation 
was examined twice: once including schools 
in the Chicago metropolitan area and a 
second time excluding them. Population 
figures were taken from the highway map. 
Towns not listed on the map were assumed 
to have populations under 1,000. As shown 
by Tables 2(a) and 2(b), the expected re- 
lation is significant at the 001 level 
whether or not Chicago schools are in- 
cluded. 


Distance from Nearest Active Coal Mine 


The next prediction was that proximity 
to coal mines would tend to lower the 
mean aptitude to be found in the school. 
Table 3 shows the distribution of schools 
in the first and fourth D.A.T. quarters ac- 
cording to their distances from the near- 
est active coal mine. This variable was 
chosen because a great number of people, 
especially in the southern part of the state, 
make their jiving through working in the 
coal mines. Some of the farmers work in 
them to earn “extra” money. Thus, exist- 
ence of coal mines becomes an important 
factor in shaping the economy of an area 
and in deciding the kind of population in- 
habiting certain localities, particularly in 
the southern half of the state. The appro- 
priate information was taken from a map 
of mineral industries (4). The relation 
predicted is significant at the .001 level. 


TABLE 2 
Mean D.A.T. Versus Size or CoMMUNITY 


rter of mean 
| 


| First | Fourth 


Population of the commu- 
nity where schools are 
located 


(a) Including Chicago Schools* (N = 167) 


Under 1,000 
1,001-5,000 

5,001-25,000 
Over 25,000 


(b) Excluding Chicago Schools® (N = 144) 


Under 1,000 
1,001-5,000 
Over 5,000 


* p < .001 by chi square for 3 d/. 
< 001 by chi square for 2 


TABLE 3 
Mean D.A.T. Versus DIsTaNcE FROM 
Nearest Active Coat MINE 
(N = 167) 


Quarter of mean 

Distance from nearest active D.A.T. of school 
coal mine 

Fourth 


First 


Less than 20 miles 16 31 
Over 20-35 miles 18 20 
Over 35-50 miles 33 ll 
Over 50 miles 27 ll 


Note.— p < .001 by chi square for 3 df. 


Value of the Farm Products Sold in the 
County 


The value of the farm products sold in 
different counties of the state was used 
as an index of the income of the residents 
of a given locality. The analysis was made 
separately for counties containing towns 
of 25,000 or over and for counties not 
containing such towns in order to find out 
whether there was any difference between 
the two kinds of counties in this respect. 
It was thought that where value of farm 


— 33 

25 28 

27 10 

28 2 

| | M4 | 33 
33 

| = — 
| 


50 KHOSSROW MOHANDESSI AND PHILIP J. RUNKEL 


products sold was higher, urban and rural 
areas would both prosper and property 
values would be high in general. It was 


TABLE 4 
Mean D.A.T. Versus VALUE oF 
Propvucts 


FaRM 


Quarter of mean 
Value of farm products sold | D.A.T. of school 
in millions of dollars 


First | Fourth 


(a) Counties Containing Towns of 25,000 or 
Larger* (N = 68) 


0-14.99 5 
15.0-24.99 30 
25.0 and above 17 


(b) Counties Not Containing Towns of 
25,000 or Larger> (N = 99) 


0-14.99 
15.0-24.99 
25.0 and over 


* p < 01 by chi square for 2 df. 
> p < .05 by chi square for 2 df. 


TABLE 5 
Mean D.A.T. Versus AVERAGE VALUE OF 
LAND AND BuILDINGS PER FARM 


er of mean 


Value of land and re AT. of school 


per farm in thousands of 
dollars 


First | Fourth 


(a) Counties Containing Towns of 25,000 or 
Larger (N = 68) 


21 8 


Less than 45.0 | 31 | 8 


More than 45.0 


(b) Counties Not Containing Towns of 
25,000 or Larger® (N = 99) 


Less than 45.0 
More than 45.0 


*p> .10. 
bp> .10. 


expected that schools with the higher 
D.A.T. means would be located in com- 
munities which showed the greater values 
of farm products sold during the year. 
Values were taken from the 1954 U. S. 
Census of Agriculture (9). Tables 4(a) 
and 4(b) present the relevant data. The 
expected relation was significant for both 
categories, i.e., counties with and without 
larger towns, beyond the .05 level. 


Value of Land and Building Per Farm 


As a second index of income and there- 
fore of the ability of rural people to give 
financial support to their schools, the 
average value of land and buildings per 
farm was used. This likewise was computed 
separately for counties containing towns 
of 25,000 or over and counties not con- 
taining such towns. The information was 
taken from the 1954 U. 8. Census of Agri- 
culture (9). Tables 5(a) and 5(b) con- 
tain the data relevant to this question. The 
partitioning of chi square (6) was used 
to test the significance of the relations 
among the variables involved; i.e., quarter 
of D.A.T., average value of land and 
buildings per farm, and whether or not 
the county contained any town of 25,000 
and over. The only significant relation— 
at the .001 level—was that between quar- 
ter and location in counties with towns of 
25,000 or over. This latter relation is of 
little interest here since it is only another 
form of the relation between mean D.A.T. 
and the distance from larger towns. In 
brief, our expectation was not borne out 
that the mean D.A.T. of the school would 
be related to the value of farm land and 
buildings in the county. We have no 
ready explanation of the reason that this 
variable should fail to show a significant 
relation while our other variables, just as 
imprecise, do so. 


Size of School 


Another variable used was the size of 
the school itself. This information is avail- 


7 

4 

5 

15 29 

12 20 

15 8 

25 
25 25 


SOCIOECONOMIC STATUS AND APTITUDE dl 


TABLE 6 
Mean D.A.T. Versus Scuoou Size* 
(N = 138) 


Quarter of mean 


Number of pupils in school D.A.T. in school 


First. | Fourth 


199 
100-299 
300-499 
500-999 
More than 1,000 


Note. — The information concerning this variable was 
lacking for 29 schools in the Program. 

* p < .001 for chi square for 4 df. (computed with the 
Yates correction for continuity) 


TABLE 7 
Mean D.A.T. Versus Posuic- 
PRIVATE VARIABLE 
(N = 167) 


Public -private variable 


Public schools 
Private schools 


p < .001 by chi square for | d/. 


able from the files of the Illinois Statewide 
High School Testing Program. The pre- 
dietion was that schools in the first quar- 
ter would be significantly larger in size 
than the schools in the fourth quarter. 
Table 6 contains the data relevant to this 
point. The prediction is supported at the 
OOL level. 


The Public-Private Variable 


‘This measure was used as another varia- 
ble which might correlate with mean ap- 
titude in the sehool. It was predicted that 
the ratio of private schools to public 
chools would be significantly larger in the 
first quarter of D.A.T. scores than in the 
fourth quarter. Here one would suppose 
the effect to be due largely to the selec- 
tiveness of the private school. Table 7 pre- 


sents the relevant data. The frequencies in 
Table 7 seem at first glance very “lop- 
sided.” The reason for this, of course, is 
that so many more public than private 
schools participate in the Illinois State- 
wide Testing Program. Nevertheless, the 
chi-square value for the relation in Table 
7 is significant at the 001 level. 


Summary anp Conciusion 


The purpose of the present study was 
to investigate "ome socioeconomic cor- 
relates of academic aptitude. The study 
was intended to be exploratory rather 
than precise. Academic aptitude was ex- 
amined by using schools whose mean test 
scores were in the upper or lower quarters 
among schools participating in the State- 
wide Testing Program of the University 
of Illinois. Findings of the study were that 
the mean academic aptitude of the school 
was strongly related to (a) the distance 
from larger towns, (6) the community pop- 
ulation, (c) the distance from active coal 
mines, (d) the value of farm products sold 
in the county where the school is located, 
(e) the size of the school, and (f) whether 
the school is publie or private. 

REFERENCES 

1. Bennetr, G. K., Seasnore, H. G, & 
Wesman, A. G. Differential aptitu!e 
tests, Form A. New York: The Psy- 
chological Corp., 1947. 

2. Davis, A. Child training and social class. 
In R. G. Barker, J.S. Kounin, & H. F. 
Wright (Eds.) Child behavior and de- 
velopment. New York: McGraw-Hill, 
1943. Pp. 607-619. 

3. Davis, A. American status systems and 
the socialization of the child. In C. 
Kluckhohn, & H. A. Murray, (Fads.) 
Personality in nature, society, and 
culture. New York: Knopf, 1949. Pp. 
459 468. 

4. Edueational Extension Division, State 
Geological Survey, Department of 
Registration and Education, State of 
Illinois. Mineral industries of Illinois. 
(Map) Springfield: Author, 1955. 

5. Empey, L. T. Social class and ocewpa- 


» 
23 | 36 
13. S| 
| 4 
| Quarter of mean 
| 
| First | Fourth: 
| 2% | 2 
| 


KHOSSROW MOHANDESSI AND PHILIP J. RUNKEL 


tional aspiration: A comparison of ab- 
solute and relative measurement. 
Amer sociol. Rev., 1956, 21, 703-709. 

6. Lancaster, H. O. Complex contingency 
tables treated by the partition of chi- 
square. J. Royal stat. Society, Series 
B, 1951, 13, 242-249. 

7. W. G., & 8. D. 
A study of secondary school character- 
istics as related to test scores. Re- 
search Bulletin No. 56-6. Princeton: 
Educational Testing Service, 1956. 

8. THornoirke, R. L. Community variables 
as predictors of intelligence and aca- 


demic achievement. J. educ. Psychol. 
1951, 42, 321-338. 

9. U. 8S. Bureau of Census. U.S. census of 
agriculture: Vol. 1, Counties and state 
economic areas, Part 5, 1954. Washing- 
ton: U. 8. Government Printing Office, 
1956. 

10. Warner, W. L., Havincnurst, R. J, & 
Logs, M. B. The social role of the 
teacher. In T. M. Newcomb, & E. L. 
Hartley, (Eds.) Readings in social 
psychology. New York: Henry Holt, 
1947. Pp. 479-480. 


Received October 24, 1967. 


| 


- 


Publication 
Manual 


of the 
American 
Psychological 
Association 


1957 Revision 


A revision of the 1952 Manual, 
detailed instructions are given 
for the preparation of scientific 
articles. Organization and pres- 
entation of tabular material, fig- 
ures and graphs, and reference 
lists are included. All scientists 
who are writing for publication 
will find the Publication Man- 
ual an indispensable guide. 


Price, $1.00 


Discounts for quantity 
orders over fifty copies 


Order from 


AMERICAN 
PSYCHOLOGICAL 
ASSOCIATION 
Publications Office 
1333 Sixteenth Street, N. W. 
Washington 6, D. C. 


| 

| 

| 


Contemporary 


Psychology 


A JOURNAL OF REVIEWS 
Edwin G. Boring, Editor 


HAVE YOU READ THESE 
REVIEWS? 


The Clinical Psychologist, by 
W. A. Hunt 


Reviewer: G. A. Kelly 
Basic Psychology, by Leonard 
Carmichael 

Reviewer: C. T. Morgan 
Sense and Nonsense in Psy- 
chology, by H. J. Eysenck 

Reviewer: Ruth S. Tolman 
Theories of Personality, by 
C.S. Hall and Gardner Lindzey 

Reviewer: H. L. Ansbacher 
School Problems in Human 


Relations, by Lloyd Cook and 
Elaine Cook 


Reviewer: S. M. Corey 


Subscription, $8.00 
Foreign, $8.50 
Single Copy $1.00 


AMERICAN 
PSYCHOLOGICAL 
ASSOCIATION 
Publications Office 

1333 Sixteenth Street, N.W. 
Washington 6, D. C. 


Psychology 
and Mental Health 


A report of the Institute on Education and Training 
for Psychological Contributions to Mental Health, 
held at Stanford University in August, 1965. 


Training for Research in the Mental 
Health Field 


Problems of Departmental Organization 
Price: $1.75 


» 
| Edited by Cuantes R. Strrotuen 
Topics discussed at the Inetitute: 
Training Needs of Psychologists in Com- 
munity Mental Health 
Specialization in Traini 
Practicum Traini 
Training for Therapy 
Order from: 
American Psychological Association, Inc. 
1333 Sixteenth Street, N. W. 
Washington 6, D. C. 


IMPROVED 
SERV ICES 
NEW TESTS 
NEW INSIGHTS 
FROM 
The Picture Werid Test 
Bowser and 
Monss P. Manson 
The Grayson Perceptualiza- 
tiem Test 
Harry M. Gaarson 
The Im- 
@erms fer Adalte and 
Bertram R. Forzn 
The Forer Vocational Survey 
R. 
Roses N. Cases. 
The M A C C Behavicral 
Rosear B. 
vision: -Drawing 
Isaac JOLLES 
The H T P 
} The V. J. 
Knowledge and Informa- 
tion Test 
H. R. Trapeman 
Fibre Built Deil Play House 
Plastic Dell Play Family 
Hundreds of 
Ste 
Apperatus 
New 1958-1959 Cata- 
logues Sent upon 
Request 
WESTERN 
PSYCHOLOGICAL 
SERVICES 
. 10655 SANTA 
MONICA BOULEVARD 
LOS ANGELES 35, 
CALIFORNIA 


